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5T32HG and BIR9214821) and the National Institutes of Health (NIH grants 
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provisiona\application serial no. 60/097,788, filed August 25, 1998, and serial no. 
60/099, 113\filed September 3, 1998, all of which are incorporated in their entirety by 
reference herein. 



BACKGROUND OF THE INVENTION 
5j5 Genomic technology has advanced to a point at which, in principle, it has 

m become possible to determine complete genomic sequences and to quantitatively 
2 measure the mRNA levels for each gene expressed in a cell. For some species the 
fl complete genomic sequence has now been determined, and for one strain of the 
yeast Saccharomyces cervisiae, the mRNA levels for each expressed gene have 
20 been precisely quantified under different growth conditions (Velculescu et al., 1997). 
Comparative cDNA array analysis and related technologies have been used to 
determine induced changes in gene expression at the mRNA level by concurrently 
monitoring the expression level of a large number of genes (in some cases all the 
genes) expressed by the investigated cell or tissue (Shalon et al., 1996). 
25 Furthermore, biological and computational techniques have been used to correlate 
specific function with gene sequences. The interpretation of the data obtained by 
these techniques in the context of the structure, control and mechanism of biological 
systems has been recognized as a considerable challenge. In particular, it has been 
extremely difficult to explain the mechanism of biological processes by genomic 
30 analysis alone. 



Proteins are essential for the control and execution of virtually every biological 
process. The rate of synthesis and the half-life of proteins and thus their expression 



level are also controlled post-transcriptionally. Furthermore, the activity of proteins is 
frequently modulated by post-translational modifications, in particular protein 
phosphorylation, and dependent on the association of the protein with other 
molecules including DNA and proteins. Neither the level of expression nor the state 
5 of activity of proteins is therefore directly apparent from the gene sequence or even 
the expression level of the corresponding mRNA transcript. It is therefore essential 
that a complete description of a biological system include measurements that 
indicate the identity, quantity and the state of activity of the proteins which constitute 
the system. The large-scale (ultimately global) analysis of proteins expressed in a 
10 cell or tissue has been termed proteome analysis (Pennington et al., 1997). 

It present no protein analytical technology approaches the throughput and 
level of automation of genomic technology. The most common implementation of 
proteome^analysis is based on the separation of complex protein samples most 
@g commonly By two-dimensional gel electrophoresis (2DE) and the subsequent 
4f5 sequential identification of the separated protein species (Ducret et al., 1998; 

Garrels et al.,\997; Link et al., 1997; Shevchenko et al., 1996; Gygi et al. 1999; 
> Boucherie et al.\1996). This approach has been revolutionized by the development 
of powerful mass^pectrometric techniques and the development of computer 
algorithms which correlate protein and peptide mass spectral data with sequence 
fe?0 databases and thus rapidly and conclusively identify proteins (Eng et al., 1994; 

Mann and Wilm, 1994-Vates et al., 1995). This technology has reached a level of 
sensitivity which now perrnits the identification of essentially any protein which is 
detectable by conventionaPorotein staining methods including silver staining (Figeys 
and Aebersold, 1998; Figeys \t al., 1996; Figeys et al., 1997; Shevchenko et al., 
25 1996). However, the sequentianrianner in which samples are processed limits the 
sample throughput, the most sensitive methods have been difficult to automate and 
low abundance proteins, such as regulatory proteins, escape' detection without prior 
enrichment, thus effectively limiting the dynamic range of the technique. In the 
2DE/(MS)N method, proteins are quanri(ied by densitometry of stained spots in the 
30 2DE gels. 

The development of methods and insb\U mentation for automated, data- 
dependent electrospray ionization (ESI) tandem mass spectrometry (MS n ) in 
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conjunction with microcapillary liquid chromatography (pLC) and database searching 
has\significantly increased the sensitivity and speed of the identification of gel- 
sepa\ated proteins. As an alternative to the 2DE/MS n approach to proteome 
analysis, the direct analysis by tandem mass spectrometry of peptide mixtures 
generated by the digestion of complex protein mixtures has been proposed (Dongr'e 
et al., 1997). pLC-Ms/MS has also been used successfully for the large-scale 
identification of individual proteins directly from mixtures without gel electrophoretic 
separation (Link et al., 1999; Opitek et al., 1997) While these approaches 
dramatically accelerate protein identification, the quantities of the analyzed proteins 
cannot be easily determined, and these methods have not been shown to 
substantially allteviate the dynamic range problem also encountered by the 
2DE/MS/MS approach. Therefore, low abundance proteins in complex samples are 
also difficult to analyze by the uLC/MS/MS method without their prior enrichment. 

It is therefore apparent that current technologies, while suitable to identify the 
components of protein mixtures, are neither capable of measuring the quantity nor 
the state of activity of the protein in a mixture. Even evolutionary improvements of 
the current approaches are unlikely to advance their performance sufficiently to 
make routine quantitative and functional proteome analysis a reality. 

This invention provides methods and reagents that can be employed in 
proteome analysis which overcome the limitations inherent in traditional techniques. 
The basic approach described can be employed for the quantitative analysis of 
protein expression in complex samples (such as cells, tissues, and fractions 
thereof), the detection and quantitation of specific proteins in complex samples, and 
the quantitative measurement of specific enzymatic activities in complex samples. 

In this regard, a multitude of analytical techniques are presently available for 
clinical and diagnostic assays which detect the presence, absence, deficiency or 
excess of a protein or protein function associable with a normal or disease state. 
While these techniques are quite sensitive, they do not necessarily provide chemical 
speciation of products and may, as a result, be difficult to use for assaying several 
proteins or enzymes simultaneously in a single sample. Current methods may not 
distinguish among aberrant expression of different enzymes or their malfunctions 
which lead to a common set of clinical symptoms. The methods and reagents 



herein can be employed in clinical and diagnostic assays for simultaneous 
(multiplex) monitoring of multiple proteins and protein reactions. 

SUMMARY OF THE INVENTION 
This invention provides analytical reagents and mass spectrometry-based 
methods using these reagents for the rapid, and quantitative analysis of proteins or 
protein function in mixtures of proteins. The analytical method can be used for 
qualitative and particularly for quantitative analysis of global protein expression 
profiles in cells and tissues, i.e. the quantitative analysis of proteomes. The method 
can also be employed to screen for and identify proteins whose expression level in 
cells, tissue or biological fluids is affected by a stimulus (e.g., administration of a 
drug or contact with a potentially toxic material), by a change in environment (e.g., 
nutrient level, temperature, passage of time) or by a change in condition or cell 
state (e.g., disease state, malignancy, site-directed mutation, gene knockouts) of the 
cell, tissue or organism from which the sample originated. The proteins identified in 
such a screen can function as markers for the changed state. For example, 
comparisons of protein expression profiles of normal and malignant cells can result 
in the identification of proteins whose presence or absence is characteristic and 
diagnostic of the malignancy. 

In an exemplary embodiment, the methods herein can be employed to screen 
for changes in the expression or state of enzymatic activity of specific proteins. 
These changes may be induced by a variety of chemicals, including pharmaceutical 
agonists or antagonists, or potentially harmful or toxic materials. The knowledge of 
such changes may be useful for diagnosing enzyme-based diseases and for 
investigating complex regulatory networks in cells. 

The methods herein can also be used to implement a variety of clinical and 
diagnostic analyses to detect the presence, absence, deficiency or excess of a given 
protein or protein function in a biological fluid (e.g., blood), or in cells or tissue. The 
method is particularly useful in the analysis of complex mixtures of proteins, i.e., 
those containing 5 or more distinct proteins or protein functions. 



The inventive method employs affinity-labeled protein reactive reagents that 
allow for the selective isolation of peptide fragments or the products of reaction with 
a given protein (e.g., products of enzymatic reaction) from complex mixtures. The 
isolated peptide fragments or reaction products are characteristic of the presence of 
a protein or the presence of a protein function, e.g., an enzymatic activity, 
respectively, in those mixtures. Isolated peptides or reaction products are 
characterized by mass spectrometric (MS) techniques. In particular, the sequence 
of isolated peptides can be determined using tandem MS (MS n ) techniques, and by 
application of sequence database searching techniques, the protein from which the 
sequenced peptide originated can be identified. The reagents also provide for 
differential isotopic labeling of the isolated peptides or reaction products which 
facilitates quantitative determination by mass spectrometry of the relative amounts 
of proteins in different samples. Also, the use of differentially isotopically-labeled 
reagents as internal standards facilitates quantitative determination of the absolute 
amounts of one or more proteins or reaction products present in the sample. 

in general, the affinity labeled protein reactive reagents of this invention have 
three portions: an affinity label (A) covalently linked to a protein reactive group 
(PRG) through a linker group (L): 

\ A-L-PRG 
The linker maySbe differentially isotopically labeled, e.g., by substitution of one or 
more atoms in tnte linker with a stable isotope thereof. For example, hydrogens can 
be substituted witfvdeuteriums or C 12 with C 13 . 

The affinity labekA functions as a molecular handle that selectively binds 
covalently or non-covalently, to a capture reagent (CR). Binding to CR facilitates 
isolation of peptides, substrates or reaction products tagged or labeled with A. In 
specific embodiments, A is a\strepavidin or avidinn. After affinity isolation of affinity 
tagged materials, some of whi^h may be isotopically labeled, the interaction between 
A and the capture reagent is disrupted or broken to allow MS analysis of the isolated 
materials. The affinity label may W displaced from the capture reagent by addition 
of displacing ligand, which may beVee A or a derivative of A, or by changing 
solvent (e.g., solvent type or pH) or temperature conditions or the linker may be 



cleaved chemically, enzymatically, thermally or photochemically to release the 
isolated materials for MS analysis. 



Two types of PRG groups are specifically provided herein: (a) those groups 
that selectively react with a protein functional group to form a covalent or non- 
covalent bond tagging the protein at specific sites, and (b) those that are 
transformed by action of the protein, e.g., that are substrates for an enzyme. In 
specific embodiments, PRG is a group having specific reactivity for certain protein 
groups, such as specificity for sulfhydryl groups, and is useful in general for 
selectively tagging proteins in complex mixtures. A sulfhydryl specific reagent tags 
proteins containing cysteine. In other specific embodiments, PRG is an enzyme 
substrate that is selectively cleaved (leaving A-L) or modified (giving A-L-PRG') by 
the action of an enzyme of interest. 

Exemplary reagents have the general formula: 

A-B 1 -X 1 -(CH 2 ) n -[X 2 -(CH 2 ) m ] x -X 3 -(CH 2 ) p -X 4 -B 2 -PRG 

where: 

A is the affinity label; 

PRG is the protein reactive group; 

X 1 , X 2 , X 3 and X 4 , independently of one another, and X 2 independently of 
other X 2 in the linker group, can be selected from O, S, NH, NR, NRR'\ CO, 
COO, COS, S-S, SO, S0 2 , CO-NR', CS-NR\ Si-O, aryl or diaryl groups orX 1 - 
X 4 may be absent, but preferably at least one of X 1 -X 4 is present; 
B 1 and B 2 , independently of one another, are optional moieties that can 
faciliate bonding of the A or PRG group to the linker or prevent undesired 
cleavage of those groups from the linker and can be selected, for example, 
from COO, CO, CO-NR', CS-NR' and may contain one or more CH 2 groups 
alone or in combination with other groups, e.g.( CH 2 ) q -CONR\ (CH 2 ) q -CS- 
NR'.or (CH 2 ) q ; 

n, m, p and q are whole numbers that can have values from 0 to about 100, 
preferably one of n, m, p or q is not 0 and x is also a whole number that can 
range from 0 to about 100 where the sum of n+xm+p+q is preferably less 
than about 100 and more preferably less than about 20; 
R is an alkyl, alkenyl, alkynyl, alkoxy or aryl group; and 



R' is a hydrogen, an alkyl, alkenyl, alkynyl, alkoxy or aryl group. 

One or more of the CH 2 groups of the linker can be optionally substituted with 
small (C1-C6) alkyl, alkenyl, or alkoxy groups, an aryl group or can be substituted 
with functional groups that promote ionization, such as acidic or basic groups or 
groups carrying permanent positive or negative charge. One or more single bonds 
connecting CH 2 groups in the linker can be replaced with a double or a triple bond. 
Preferred R and R' alkyl, alkenyl, alkynyl or alkoxy groups are small having 1 to 
about 6 carbon atoms. 

One or more of the atoms in the linker can be substituted with a stable 
isotope to generate one or more substantially chemically identical, but isotopically 
distinguishable reagents. For example, one or more hydrogens in the linker can be 
substituted with deuterium to generate isotopically heavy reagents. 

In an exemplary embodiment the linker contains groups that can be cleaved 
to remove the affinity tag. If a cleavable linker group is employed, it is typically 
cleaved after affinity tagged peptides, substrates or reaction products have been 
isolated using the affinity label together with the CR. In this case, any isotopic 
labeling in the linker preferably remains bound to the protein, peptide, substrate or 
reaction product. 

Linker groups include among others: ethers, polyethers, ether diamines, 
polyether diamines, diamines, amides, polyamides, polythioethers, disulfides, silyl 
ethers, alkyl or alkenyl chains (straight chain or branched and portions of which may 
be cyclic), aryl, diaryl or alkyl-aryl groups. Aryl groups in linkers can contain one or 
more heteroatoms (e.g., N, O or S atoms). 

In one aspect, the invention provides a mass spectrometric method for 
identification and quantitation of one or more proteins in a complex mixture which 
employs affinity labeled reagents in which the PRG is a group that selectively reacts 
with certain groups that are typically found in peptides (e.g.,sulfhydryl, amino, 
carboxy, homoserine lactone groups). One or more affinity labeled reagents with 
different PRG groups are introduced into a mixture containing proteins and the 
reagents react with certain proteins to tag them with the affinity label. It may be 



necessary to pretreat the protein mixture to reduce disulfide bonds or otherwise 
facilitate affinity labeling. After reaction with the affinity labeled reagents, proteins in 
the complex mixture are cleaved, e.g., enzymatically, into a number of peptides. 
This digestion step may not be necessary, if the proteins are relatively small. 
Peptides that remain tagged with the affinity label are isolated by an affinity isolation 
method, e.g., affinity chromatography, via their selective binding to the CR. Isolated 
peptides are released from the CR by displacement of A or cleavage of the linker, 
and released materials are analyzed by liquid chromatography/mass spectrometry 
(LC/MS). The sequence of one or more tagged peptides is then determined by 
MS n techniques. At least one peptide sequence derived from a protein will be 
characteristic of that protein and be indicative of its presence in the mixture. Thus, 
the sequences of the peptides typically provide sufficient information to identify one 
or more proteins present in a mixture. 

Quantitative relative amounts of proteins in one or more different samples 
containing\protein mixtures (e.g., biological fluids, cell or tissue lysates, etc.) can be 
determined\jsing chemically identical, affinity tagged and differentially isotopically 
labeled reagetats to affinity tag and differentially isotopically label proteins in the 
different samples. In this method, each sample to be compared is treated with a 
different isotopicaUy labeled reagent to tag certain proteins therein with the affinity 
label. The treated samples are then combined, preferably in equal amounts, and 
the proteins in the combined sample are enzymatically digested, if necessary, to 
generate peptides. Some of the peptides are affinity tagged and in addition tagged 
peptides originating fromvdifferent samples are differentially isotopically labeled. As 
described above, affinity labeled peptides are isolated, released from the capture 
reagent and analyzed by (LC/MS). Peptides characteristic of their protein origin are 
sequenced using MS n techniques allowing identification of proteins in the samples. 
The relative amounts of a giverk protein in each sample is determined by comparing 
relative abundance of the ions generated from any differentially labeled peptides 
originating from that protein. The Viethod can be used to assess relative amounts of 
known proteins in different samplesk Further, since the method does not require any 
prior knowledge of the type of proteins that may be present in the samples, it can be 
used to identify proteins which are present at different levels in the samples 
examined. More specifically, the method can be applied to screen for and identify 
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proteins which exhibit differential express in cells, tissue or biological fluids. It is also 
possible to determine the absolute amounts of specific proteins in a complex 
mikture. In this case, a known amount of internal standard, one for each specific 
protein in the mixture to be quantified, is added to the sample to be analyzed. The 
internal standard is an affinity tagged peptide that is identical in chemical structure to 
the affinity tagged peptide to be quantified except that the internal standard is 
differentially isotopically labeled, either in the peptide or in the affinity tag portion, to 
distinguish it from the affinity tagged peptide to be quantified. The internal standard 
can be provided in the sample to be analyzed in other ways. For example, a specific 
protein or set of proteins can be chemically tagged with an isotopically-labeled 
affinity taggrng reagent. A known amount of this material can be added to the 
sample to be^analyzed. Alternatively, a specific protein or set of proteins may be 
labeled with heavy atom isotopes and then derivatized with an affinity tagging 
reagent. \ 

Also, it is possible to quantify the levels of specific proteins in multiple 
samples in a single analysis (multiplexing). In this case, affinity tagging reagent s 
used to derivatize proteins present in different affinity tagged peptides from different 
samples can be selectively quantified by mass spectrometry. 

In this aspect of the invention, the method provides for quantitative 
measurement of specific proteins in biological fluids, cells or tissues and can be 
applied to determine global protein expression profiles in different cells and tissues. 
The same general strategy can be broadened to achieve the proteome-wide, 
qualitative and quantitative analysis of the state of modification of proteins, by 
employing affinity reagents with differing specificity for reaction with proteins. The 
method and reagents of this invention can be used to identify low abundance 
proteins in complex mixtures and can be used to selectively analyze specific groups 
or classes of proteins such as membrane or cell surface proteins, or proteins 
contained within organelles, sub-cellular fractions, or biochemical fractions such as 
immunoprecipitates. Further, these methods can be applied to analyze differences 
in expressed proteins in different cell states. For example, the methods and 
reagents herein can be employed in diagnostic assays for the detection of the 



presence or the absence of one or more proteins indicative of a disease state, such 
as cancer. 

In a second aspect, the invention provides a MS method for detection of the 
presence or absence of a protein function, e.g., an enzyme activity, in a sample. 
The method can also be employed to detect a deficiency or excess (over normal 
levels) of protein function in a sample. Samples that can be analyzed include 
various biological fluids and materials, including tissue and cells. In this case, the 
PRG of the affinity labeled reagent is a substrate for the enzyme of interest. Affinity 
labeled substrates are provided for each enzyme of interest and are introduced into 
a sample where they react to generate affinity labeled products, if the enzyme of 
interest is present in the sample. Products or unreacted substrate that are tagged 
with the affinity label are isolated by an affinity isolation method, e.g., affinity 
chromatography, via their selective binding to the CR. The isolated tagged 
substrates and products are analyzed by mass spectrometry. Affinity labeled 
products include those in which the substrate is entirely cleaved from the linker or in 
which the substrate is modified by reaction with a protein of interest. Detection of 
the affinity-labeled product indicates the protein function is present in the sample. 
Detection of little or no affinity labeled product indicates deficiency or absence, 
respectively, of the protein function in the sample. 

The amount of selected protein, e.g., measured in terms of enzyme activity, 
present in a sample can be measured by introducing a known amount of an internal 
standard which is an isotopically labeled analog of the expected product of the 
enzymatic reaction of the reagent substrate. The internal standard is substantially 
chemically identical to the expected enzymatic reaction product, but is isotopically 
distinguishable therefrom. The level of protein function (e.g., enzymatic activity) in a 
given sample can be compared with activity levels in other samples or controls 
(either negative or positive controls). The procedure therefore can detect the 
presence, absence, deficiency or excess of a protein function in a sample. The 
method is capable of quantifying the velocity of an enzymatic reaction since it 
enables the amount of product formed over a known time period to be measured. 
This method can be multiplexed, by simultaneous use of a plurality of affinity labeled 
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substrates selective for different protein functions and if quantitation is desired by 
inclusion of the corresponding internal standards for expected products, to analyze 
for a plurality of protein functions in a single sample. 

^ - 

DETAILED DESCRIPTION OF THE INVENTION 
The methods of this invention employ affinity tagged protein reactive reagents 
in which the affinity tag is covalently attached to a protein reactive group by a linker. 
The linker can be isotopically labeled to generate pairs or sets of reagents that are 
substantially chemically identical, but which are distinguishable by mass. For 
example a pair of reagents, one of which is isotopically heavy and the other of which 
is isotopically light can be employed for the comparison of two samples one of which 
may be a reference sample containing one or more known proteins in known 
amounts. For example, any one or more of the hydrogen, nitrogen, oxygen or sulfur 
atoms in the linker may be replaced with their isotopically stable isotopes: 2 H, 13 C, 
15 N, 17 O f 18 0 or "S. 

Suitable affinity tags bind selectively either covalently or non-covalently and 
with high affi<iity to a capture reagent (CR). The CR-A interaction or bond should 
remain intact afa^r extensive and multiple washings with a variety of solutions to 
remove non-specifibglly bound components. The affinity tag binds minimally or 
preferably not at all to cbflQponents in the assay system, except CR, and does not 
significantly bind to surfaces^! reaction vessels. Any non-specific interaction of the 
affinity tag with other componentry^ surfaces should be disrupted by multiple 
washes that leave CR-A intact. Furtntec it must be possible to disrupt the interaction 
of A and CR to release peptides, substrat^or reaction products, for example, by 
addition of a displacing ligand or by changing rhe temperature or solvent conditions. 
Preferably, neither CR or A react chemically with ott^er components in the assay 
system and both groups should be chemically stable ^er the time period of an 
assay or experiment. The affinity tag preferably does norhndergo peptide-like 
fragmentation during (MS) n analysis. The affinity label is preferably soluble in the 
sample liquid to be analyzed and the CR should remain soluble ih\toe sample liquid 
even though attached to an insoluble resin such as Agarose. In the case of CR term 
soluble means that CR is sufficiently hydrated or otherwise solvated sucrNtiat it 
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functions properly for binding to A. CR or CR-containing conjugates should not be 
present in the sample to be analyzed, except when added to capture A. 

Examples of A and CR pairs include: 

d-btatin or structurally modified biotin-based reagents, including d-iminobiotin, 
which bind to\proteins of the avidin/streptavidin, which may, for example, be used in 
the forms of strW>avidin-Agarose, oligomeric-avidin-Agarose, or monomeric-avidin- 
Agarose; \ 

any 1,2-diol,\jch as 1,2-dihydroxyethane (HO-CH 2 -CH 2 -OH), and other 1,2- 
dihydroxyalkanes including those of cyclic alkanes, e.g., 1 ,2-dihydroxycyclohexane 
which bind to an alkyl or a^yl boronic acid or boronic acid esters , such as phenyl- 
B(OH) 2 or hexyl-B(OEthyl) 2 which may be attached via the alkyl or aryl group to a 
solid support material, such as^garose; 

maltose which binds to maltose binding protein (as well as any other 
sugar/sugar binding protein pair or nWe generally to any ligand/ligand binding 
protein pairs that has properties discussed above); 

a hapten, such as dinitrophenyl grouft. for any antibody where the hapten 
binds to an anti-hapten antibody that recognizes the hapten, for example the 
dinitrophenyl group will bind to an anti-dinitrophenyMgG; 

a ligand which binds to a transition metal, for example, an oligomeric histidine 
will bind to Ni(ll), the transition metal CR may be used irHhe form of a resin bound 
chelated transition metal, such as nitrilotriacetic acid-chelated Ni(ll) or iminodiacetic 
acid-chelated Ni(ll); \. 

glutathione which binds to glutathione-S-transferase. \^ 

In general, any A-CR pair commonly used for affinity enrichment which meets 
the suitability criteria discussed above. Biotin and biotin-based affinity tags are 
preferred. Of particular interest are structurally modified biotins, such as d- 
iminobiotin, which will elute from avidin or strepavidin columns under solvent 
conditions compatible with ESI-MS analysis, such as dilute acids containing 10-20% 
organic solvent. It is expected that d-iminobiotin tagged compounds will elute in 
solvents below pH 4. d-lminobiotin tagged protein reactive reagents can be 
synthesized by methods described herein for the corresponding biotin tagged 
reagents. 
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displacement ligand, DL, is optionally used to displace A from CR. Suitable 
DLs are nottypjcally present in samples unless added. DL should be chemically 
and enzymatically^taWe in the sample to be analyzed and should not react with or 
bind to components (oth^ttian CR) in samples or bind non-specifically to reaction 
vessel walls. DL preferably doe^nqt undergo peptide-like fragmentation during MS 
analysis, and its presence in sample sfibuld not significantly suppress the ionization 
of tagged peptide, substrate or reaction produl^qonjugates. 

DL itself preferably is minimally ionized during mass spectrometric analysis 
and the formation of ions composed of DL clusters is preferably minimal. The 
selection of DL, depends upon the A and CR groups that are employed. In general, 
DL is selected to displace A from CR in a reasonable time scale, at most within a 
week of its addition, but more preferably within a few minutes or up to an hour. The 
affinity of DL for CR should be comparable or stronger than the affinity of the tagged 
compounds containing A for CR. Furthermore, DL should be soluble in the solvent 
used during the elution of tagged compounds containing A from CR. DL preferably 
is free A or a derivative or structural modification of A. Examples of DL include, d- 
biotin or d-biotin derivatives, particularly those containing groups that suppress 
cluster formation or suppress ionization in MS. 

The linker group (L) should be soluble in the sample liquid to be analyzed and 
it should be stable with respect to chemical reaction, e.g., substantially chemically 
inert, with components of the sample as well as A and CR groups. The linker when 
bound to A should not interfere with the specific interaction of A with CR or interfere 
with the displacement of A from CR by a displacing ligand or by a change in 
temperature or solvent. The linker should bind minimally or preferably not at all to 
other components in the system, to reaction vessel surfaces or CR. Any non- 
specific interactions of the linker should be broken after multiple washes which leave 
the A-CR complex intact. Linkers preferably do not undergo peptide-like 
fragmentation during (MS) n analysis. At least some of the atoms in the linker groups 
should be readily replaceable with stable heavy-atom isotopes. The linker preferably 
contains groups or moieties that facilitate ionization of the affinity tagged reagents, 
peptides, substrates or reaction products. 
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To promote ionization, the linker may contain acidic or basic groups, e.g., 
COOH, SO3H, primary, secondary or tertiary amino groups, nitrogen-heterocycles, 
ethers, or combinations of these groups. The linker may also contain groups having 
a permanent charge, e.g., phosphonium groups, quaternary ammonium groups, 
sulfonium groups, chelated metal ions, tetralky or tetraryl borate or stable 
carbanions. 

The covalent bond of the linker to A or PRG should typically not be 
unintentionally cleaved by chemical or enzymatic reactions during the assay. In 
some cases it may be desirable to cleave the linker from the affinity tag A or from 
the PRG, for example to facilitate release from an affinity column. Thus, the linker 
can be cleavable, for example, by chemical, thermal or photochemical reaction. 
Photocleavable groups in the linker may include the 1-(2-nitrophenyl)-ethyl group. 
Thermally labile linkers may, for example, be a double-stranded duplex formed from 
two complementary strands of nucleic acid, a strand of a nucleic acid with a 
complementary strand of a peptide nucleic acid, or two complementary peptide 
nucleic acid strands which will dissociate upon heating. Cleavable linkers also 
include those having disulfide bonds, acid or base labile groups, including among 
others, diarylmethyl or trimethylarylmethyl groups, silyl ethers, carbamates, 
oxyesters, thiesters, thionoesters, and a-fluorinated amides and esters. 
Enzymatically cleavable linkers can contain, for example, protease-sensitive amides 
or esters, (3-lactamase-sensitive p-lactam analogs and linkers that are nuclease- 
cleavable, or glycosidase-cleavable. 

The protein reactive group (PRG) can be a group that selectively reacts with 
certain protein functional groups or is a substrate of an enzyme of interest. Any 
selectively reactive protein reactive group should react with a functional group of 
interest that is present in at least a portion of the proteins in a sample. Reaction of 
PRG with functional groups on the protein should occur under conditions that do not 
lead to substantial degradation of the compounds in the sample to be analyzed. 
Examples of selectively reactive PRGs suitable for use in the affinity tagged 
reagents of this invention, include those which react with sulfhydryl groups to tag 
proteins containing cysteine, those that react with amino groups, carboxylate 
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groups, ester groups, phosphate reactive groups, and aldehyde and/or ketone 
reactive groups or, after fragmentation with CNBr, with homoserine lactone. 

Thiol reactive groups include epoxides, a-haloacyl group, nitriles, sulfonated 
alkyl or aryl thiols and maleimides. Amino reactive groups tag amino groups in 
proteins and include sulfonyl halides, isocyanates, isothiocyanantes, active esters, 
including tetrafluorophenyl esters, and N-hydroxysuccinimidyl esters, acid halides, 
and acid anyhydrides. In addition, amino reactive groups include aldehydes or 
ketones in the presence or absence of NaBH 4 or NaCNBH 3 . 

Carboxylic acid reactive groups include amines or alcohols in the presence of 
a coupling agent such as dicyclohexylcarbodiimide, or 2,3,5,6-tetrafluorophenyl 
trifluoroacetate and in the presence or absence of a coupling catalyst such as 
4-dimethylaminopyridine; and transition metal-diamine complexes including 
Cu(ll)phenanthroline 

Ester reactive groups include amines which, for example, react with 
homoserine lactone. 

Phosphate reactive groups include chelated metal where the metal is, for 
example Fe(lll) or Ga(lll), chelated to, for example, nitrilotriacetiac acid or 
iminodiacetic acid. 

Aldehyde or ketone reactive groups include amine plus NaBH 4 or NaCNBH 3 , 
or these reagents after first treating a carbohydrate with periodate to generate an 
aldehyde or ketone. 

PRG groups can also be substrates for a selected enzyme of interest. The 
enzyme of interest may, for example, be one that is associated with a disease state 
or birth defect or one that is routinely assayed for medical purposes. Enzyme 
substrates of interest for use with the methods of this invention include, acid 
phosphatase, alkaline phosphatase, alanine aminotransferase, amylase, angiotensin 
converting enzyme, aspartate aminotransferase, creatine kinase, 
gamma-glutamyltransferase, lipase, lactate dehydrogenase, and 
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glucose-6-phosphate dehydrogenase which are currently routinely assayed by other 
methods. 

T^he requirements discussed above for A, L, PRG, extend to the 
corresponding to the segments of A-L-PRG and the reaction products generated 
with this reagent. 

InternaKstandards, which are appropriately isotopically labeled, may be 
employed in theViethods of this invention to measure absolute quantitative amounts 
of proteins in samples. Internal standards are of particular use in assays intended to 
quantitate affinity tagged products of enzymatic reactions. In this application, the 
internal standard is chemically identical to the tagged enzymatic product generated 
by the action of the enzyme on the affinity tagged enzyme substrate, but carries 
isotope labels which mayViclude 2 H, 13 C, 15 N, 17 0, 18 0, or 34 S, that allow it to be 
independently detected by IMS techniques. Internal standards for use in method 
herein to quantitative one or several proteins in a sample are prepared by reaction of 
affinity labeled protein reactive reagents with a known protein to generate the affinity 
tagged peptides generated from oigestion of the tagged protein. Affinity tagged 
peptides internal standards are substantially chemically identical to the 
corresponding affinity tagged peptides^generated from digestion of affinity tagged 
protein, except that they are differentiallylsotopically labeled to allow their 
independent detection by MS techniques. \ 

The method of this invention can also be applied to determine the relative 
quantities of one or more proteins in two or more protein samples, the proteins in 
each sample are reacted with affinity tagging reagents which are substantially 
chemically identical but differentially isotopically labeled. The samples are 
combined and processed as one. The relative quantity of each tagged peptide 
which reflects the relative quantity of the protein from which the peptide originates is 
determined by the measurement of the respective isotope peaks by mass 
spectrometry. 

The methods of this invention can be applied to the analysis or comparison of 
multiple different samples. Samples that can be analyzed by methods of this 
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invention include cell homogenates; cell fractions; biological fluids including urine, 
blood, and cerebrospinal fluid; tissue homogenates; tears; feces; saliva; lavage 
fluids such as lung or peritoneal lavages; mixtures of biological molecules including 
proteins, lipids, carbohydrates and nucleic acids generated by partial or complete 
fractionation of cell or tissue homogenates: 

The methods of this invention employ MS and (MS) n methods. While a 
variety of MS and (MS) n are available and may be used in these methods, Matrix 
Assisted Laser Desorption Ionization MS (MALDI/MS) and Electrospray Ionization 
MS (ESI/MS) methods are preferred. 

Quantitative proteome analysis 

This method is schematically illustrated in Scheme 1 using a biotin labeled 
sulfhydryl-reactive reagent for quantitative protein profile measurements in a sample 
protein mixture and a reference protein mixture. The method comprises the 
following steps: 

Reduction. Disulfide bonds of proteins in the sample and reference mixtures 
are reduced to free SH groups. The preferred reducing agent is tri-n-butylphosphine 
which is used under standard conditions. Alternative reducing agents include 
mercaptoethylamine and dithiothreitol. If required, this reaction can be performed in 
the presence of solubilizing agents including high concentrations of urea and 
detergents to maintain protein solubility. The reference and sample protein mixtures 
to be compared are processed separately, applying identical reaction conditions; 

Derivatization of SH groups with an affinity tag. Free SH groups are 
derivatized with the biotinylating reagent biotinyl-iodoacetylamidyl-4,7,10 
trioxatridecanediamine the synthesis of which is described below. The reagent is 
prepared in different isotopically labeled forms by substitution of linker atoms with 
stable isotopes and each sample is derivatized with a different isotopically labeled 
form of the reagent. Derivatization of SH groups is preferably performed under 
slightly basic conditions (pH 8.5) for 90 min at RT. For the quantitative, comparative 
analysis of two samples, one sample each (termed reference sample and sample) 
are derivatized as illustrated in Scheme 1 with the isotopically light and the 
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isotopically heavy form of the reagent, respectively. For the comparative analysis of 
several samples one sample is designated a reference to which the other samples 
are related to. Typically, the reference sample is labeled with the isotopically heavy 
reagent and the experimental samples are labeled with the isotopically light form of 
the reagent, although this choice of reagents is arbitrary. These reactions are also 
compatible with the presence of high concentrations of solubilizing agents; 

Combination of labeled samples. After completion of the affinity tagging 
reaction defined aliquots of the samples labeled with the isotopically different 
reagents (e.g., heavy and light reagents) are combined and all the subsequent steps 
are performed on the pooled samples. Combination of the differentially labeled 
samples at this early stage of the procedure eliminates variability due to subsequent 
reactions and manipulations. Preferably equal amounts of each sample are 
combined; 

Removal of excess affinity tagged reagent. Excess reagent is adsorbed, for 
example, by adding an excess of SH-containing beads to the reaction mixture after 
protein SH groups are completely derivatized. Beads are added to the solution to 
achieve about a 5-fold molar excess of SH groups over the reagent added and 
incubated for 30 min at RT. After the reaction the beads are be removed by 
centrifugation; 

Rrotein digestion. The proteins in the sample mixture are digested, typically 
with trypsih. Alternative proteases are also compatible with the procedure as in fact 
are chemicaKfragmentation procedures. In cases in which the preceding steps were 
performed in tffie presence of high concentrations of denaturing solubilizing agents 
the sample mixture are diluted until the denaturant concentration is compatible with 
the activity of the proteases used. This step may be omitting in the analysis of 
small proteins; \ 

Affinity isolation of the affinity tagged peptides by interaction with a capture 
reagent. The biotinylated peptides are isolated on avidin-agarose. After digestion the 
pH of the peptide samples is lowered to 6.5 and the biotinylated peptides are 
immobilized on beads coated with monomeric avidin (Pierce). The beads are 
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extensively washed. The last washing solvent includes 10% methanol to remove 
residual SDS. Biotinylated peptides are eluted from avidin-agarose, for example, 
with 0.3% formic acid at pH 2; 

Analysis of the isolated, derivatized peptides by pLC-MS n or CE-MS n with data 
dependent fragmentation. Methods and instrument control protocols well-known in 
the art and described, for example, in Ducret et al., 1998; Figeys and Aebersold, 
1998; Figeys et al., 1996; or Haynes et al., 1998 are used. 

In this last step, both the quantity and sequence identity of the proteins from 
which the tagged peptides originated can be determined by automated multistage 
MS. This is achieved by the operation of the mass spectrometer in a dual mode in 
which it alternates in successive scans between measuring the relative quantities of 
peptides eluting from the capillary column and recording the sequence information of 
selected peptides. Peptides are quantified by measuring in the MS mode the 
relative signal intensities for pairs of peptide ions of identical sequence that are 
tagged with the isotopically light or heavy forms of the reagent, respectively, and 
which therefore differ in mass by the mass differential encoded within the affinity 
tagged reagent. Peptide sequence information is automatically generated by 
selecting peptide ions of a particular mass-to-charge (m/z) ratio for collision-induced 
dissociation (CID) in the mass spectrometer operating in the MS n mode. (Link, A.J. 
et al., 1997; Gygi, S.P., et al. 1999; and Gygi, S.P. et al., 1999). The resulting CID 
spectra are then automatically correlated with sequence databases to identify the 
protein from which the sequenced peptide originated. Combination of the results 
generated by MS and MS n analyses of affinity tagged and differentially labeled 
peptide samples therefore determines the relative quantities as well as the 
sequence identities of the components of protein mixtures in a single, automated 
operation. 

Results of this applying this method using the biotinylated sulfhydryl reagent 
and to the quantitative analysis of synthetic peptide samples, to the relative 
quantitation of Uie peptides in a protein digest an the tandem mass spectral analysis 
of a derivatized oeptide are shown in Fig. 1 , Table 1 , and Fig. 2, respectively. 
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This method can also be practiced using other affinity tags and other protein 
reactive groups, including amino reactive groups, carboxyl reactive groups, or 
groups that react with homoserine lactones. 

The approach employed herein for quantitative proteome analysis is based 
on two principles. First, a short sequence of contiguous amino acids from a protein 
(5-25 residues) contains sufficient information to uniquely identify that protein. 
Protein identification by MS n is accomplished by correlating the sequence 
information contained in the CID mass spectrum with sequence databases, using 
sophisticated computer searching algorithms (Eng, J. et al., 1994; Mann, M. et al., 
1994; Qin, J. et al., 1997; Clauser, K.R. et al., 1995). Second, pairs of identical 
peptides tagged with the light and heavy affinity tagged reagents, respectively, (or in 
analysis of more than two samples, sets of identical tagged peptides in which each 
set member is differentially isotopically labeled) are chemically identical and 
therefore serve as mutual internal standards for accurate quantitation. The MS 
measurement readily differentiates between peptides originating from different 
samples, representing for example different cell states, because of the difference 
between isotopically distinct reagents attached to the peptides. The ratios between 
the intensities of the differing weight components of these pairs or sets of peaks 
provide an accurate measure of the relative abundance of the peptides (and hence 
the proteins) in the original cell pools because the MS intensity response to a given 
peptide is independent of the isotopic composition of the reagents (De Leenheer, 
A.P. et al (1992). The use of isotopically labeled internal standards is standard 
practice in quantitative mass spectrometry and has been exploited to great 
advantage in, for example, the precise quantitation of drugs and metabolites in 
bodily fluids (De Leenheer, A.P. et al., 1992). 

In another illustration of the method, two mixtures consisting of the same six 
proteins at known, but different, concentrations were prepared and analyzed. The 
protein mixtures were labeled, combined and treated as schematically illustrated in 
Scheme 1 . The isolated, tagged peptides were quantified and sequenced in a single 
combined pLC-MS and pLC-MS n experiment on an ESI ion trap mass spectrometer. 
All six proteins were unambiguously identified and accurately quantified (Table 2). 
Multiple tagged peptides were encountered for each protein. The differences 
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between the observed and expected quantities for the six proteins ranged between 2 
and 12%. 

The process is further illustrated for a single peptide pair in Figs. 3A-C. A 
single scan of the mass spectrometer operated in MS mode is shown in Fig. 3A. 
Four pairs of peptide ions characterized by the mass differential encoded in the 
affinity tagged reagent are detected in this scan and indicated with their respective 
m/z values. The scan shown was acquired in 1 .3 s. Over the course of the one- 
hour chromatographic elution gradient, more than 1200 such scans were 
automatically recorded. Fig. 3B shows an expanded view of the mass spectrum 
around the ion pair with m/z ratios of 993.8 and 977.7, respectively. Co-elution and 
a detected mass differential of four units potentially identifies the ions as a pair of 
doubly charged affinity tagged peptides of identical sequence (mass difference of 
eight and a charge state of two). Fig. 3C shows the reconstructed ion 
chromatograms for these two species. The relative quantities were determined by 
integrating the contour of the respective peaks. The ratio (light/heavy) was 
determined as 0.54 (Table 1). The peaks in the reconstructed ion chromatograms 
appear serrated because in every second scan the mass spectrometer switched 
between the MS and the MS n modes to collect sequence information (CID mass 
spectrum) of a selected peptide ion. These CID spectra were used to identify the 
protein from which the tagged peptides originated. Fig. 4A shows the CID spectrum 
recorded from the peptide ion with m/z = 998 (marked with an arrow in Fig. 3A). 
Database searching with this CID spectrum identified the protein as glyceraldehyde- 
3-phosphate dehydrogenase (Fig. 4B) which was a member of the protein mixture. 

Several beneficial features of the this method are apparent. First, at least two 
peptides were detected from each protein in the mixture. Therefore, both 
quantitation and protein identification can be redundant. Second, the identified 
peptides all contained at least one tagged cysteinyl residue. The presence of the 
relatively rare cysteinyl residue in a peptide adds an additional powerful constraint 
for database searching (Sechi, S. et al., 1998). Third, tagging and selective 
enrichment of cysteine-containing peptides significantly reduced the complexity of 
the peptide mixture generated by the concurrent digestion of six proteins. For this 
protein mixture, the complexity was reduced from 293 potential tryptic peptides to 44 
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tryptic peptides containing at least one cysteinyl residue. Fourth, the peptide 
samples eluted from the avidin affinity column are directly compatible with analysis 
by pLC-MS n . 

Quantitative analysis of protein expression in different cell states 

The protein reactive affinity reagent strategy was applied to study differences 
in steady-state<Drotein expression in the yeast, S. cerevisiae, in two non-glucose 
repressed states (TaJ^le 3). Cells were harvested from yeast growing in log-phase 
utilizing either 2% galactcfee^Dr 2% ethanol as the carbon source. One-hundred pg 
of soluble yeast protein from eacFi^eellstate were labeled independently with the 
isotopically different affinity tagged reagents^The labeled samples were combined 
and subjected to the strategy described in Fig. iT^Sqe fiftieth (the equivalent of 
approximately 2 pg of protein from each cell state) of the^aqiple was analyzed. 

Glucose repression causes large numbers of proteins with metabolic 
functions significant to growth on other carbon sources to be minimally expressed 
(Ronne, H., 1995; Hodges, P.E. et al., 1999). Growth on galactose or ethanol with 
no glucose present results in the expression of glucose repressed genes. Table 3 
presents a selection of 34 yeast genes encountered in the analysis, but it contains 
every known glucose-repressed genes that was identified (Mann, M. et al., 1994). 
Each of these genes would have been minimally expressed in yeast grown on 
glucose. Genes specific to both growth on galactose (GAL1 , GAL10) as well as 
growth on ethanol (ADH2, ACH1) were detected and quantitated. 

The quantitative nature of the method is apparent in the ability to accurately 
measure small changes in relative protein levels. Evidence of the accuracy of the 
measurements can be seen by the excellent agreement found by examining ratios 
for proteins for which multiple peptides were quantified. For example, the five 
peptides found from PCK1 had a mean ratio ±95% confidence intervals of 
1.57±0.15, and the percent error was <10%. In addition, the observed changes fit 
the expected changes from the literature (Ronne, H., 1995; Hodges, P.E. et al., 
1999). Finally, the observed changes are in agreement with the changes in staining 
intensity for these same proteins examined after two-dimensional gel electrophoresis 
(data not shown). 
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The alcohol dehydrogenase family of isozymes in yeast facilitates growth on 
either hexose sugars (ADH1) and ethanol (ADH2). The gene ADH2 encodes an 
enzyme that is both glucose- and galactose-repressed and permits a yeast cell to 
grow entirely on ethanol by converting it into acetaldehyde which enters the TCA 
cycle (Fig. 5A). In the presence of sugar, ADH1 performs the reverse reaction 
converting acetaldehyde into ethanol. The regulation of these isozymes is key to 
carbon utilization in yeast (Ronne, H., 1995). The ability to accurately measure 
differences in gene expression across families of isozymes is sometimes difficult 
using cDNA array techniques because of cross hybridization (DeRisi, J.L. et al., 
1997). The method of this invention applied as illustrated in Fig.1 succeeded in 
measuring gene expression for each isozyme even though ADH1 and ADH2 share 
93% amino acid (88% nucleotide) sequence similarity. This was because the affinity 
tagged peptides from each isozyme differed by a single amino acid residue (valine to 
threonine) which shifted the retention time by more than 2 min and the mass by 2 
daltons for the ADH2 peptides (Fig. 5B). ADH1 was expressed at approximately 2- 
fold high levels when galactose was the carbon source compared with ethanol. 
Ethanol-induction of ADH2 expression resulted in more than 200-fold increases 
compared with galactose-induction. 

The results described above illustrate that the method of this invention 
provides quantitative analysis of protein mixtures and the identification of the protein 
components therein in a single, automated operation. 

Th^method as applied using a sulfhydryl reactive reagent significantly 
reduces the complexity of the peptide mixtures because affinity tagged cysteine- 
containing peptides aTBv^electively isolated. For example, a theoretical tryptic digest 
of the entire yeast proteom^Q1 13 proteins) produces 344,855 peptides, but only 
30,619 of these peptides contain a^steinyl residue. Thus, the complexity of the 
mixture is reduced, while protein quantitation and identification are still achieved. 
The chemical reaction in of the sulfhydryl reacjBQt with protein can be performed in 
the presence of urea, sodium dodecyl sulfate (SDS)>§alts and other chemicals that 
do not contain a reactive thiol group. Therefore, protein^o^n be kept in solution with 
powerful stabilizing agents until they are enzymatically digest^d N The sensitivity of 
the mLC-MS" system is dependent of the sample quality. In particular commonly 
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vLised protein solubilizing agents are poorly compatible or incompatible with MS. 
Affinity purification of the tagged peptides completely eliminates contaminants 
incompatible with MS. The quantitation and identification of low abundance proteins 
by cohyentional methods requires large amounts (milligrams) of starting protein 
lysate arid involves some type of enrichment for these low abundance proteins. 
Assays described above, start with about 100 (jg of protein and used no 
fractionation\echniques. Of this, approximately 1/50 of the protein was analyzed in 
a single pLC-MS n experiment. This system has a limit of detection of 10-20 fmol per 
peptide (Gygi, Sj?. et al. f 1999). For this reason, in the assays described which 
employ |jLC-MS n o\ly abundant proteins are detected. However, the methods of this 
invention are compatible with any biochemical, immunological or cell biological 
fractionation methods mat reduce the mixture complexity and enrich for proteins of 
low abundance while quantitation is maintained. This method can be redundant in 
both quantitation and identification if multiple cysteines are detected. There is a 
dynamic range associated with the ability of the method to quantitate differences in 
expression levels of affinity tagged peptides which is dependent on both the 
intensity of the peaks corresponding the peptide pair (or set) and the overall mixture 
complexity. In addition, this dynamic, range will be different for each type of mass 
spectrometer used. The ion trap wasWiployed in assays described herein because 
of its ability to collect impressive amounts of sequencing information (thousands of 
proteins can potentially be identified) in a data-dependent fashion even though it 
offers a more limited dynamic quantitation ratoge. The dynamic range of the ion trap 
(based on signal-to-noise ratios) varied depending on the signal intensity of the 
peptide pair and complexity of the mixture, but differences of up to 100-fold were 
generally detectable and even larger differences could be determined for more 
abundant peptides. In addition, protein expression leveJ changes of more than 100- 
200-fold still identify those proteins as major potential contributors to the phenotypic 
differences between the two original cell states. The method can be extended to 
include reactivity toward other functional groups. A small percentage of proteins 
(8% for S. cerevisiae) contain no cysteinyl residues and are theKefore missed by 
analysis using reagents with sulfhydryl group specificity (i.e., thiol group specificity). 
Affinity tagged reagents with specificities toward functional groups other than 
sulfhydryl groups will also make cysteine-free proteins susceptible to Analysis. 
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The methods of this invention can be applied to analysis of low abundance 
proteins and classes of proteins with particular physico-chemical properties including 
poor solubility, large or small size and extreme p/ values. 

The prototypical application of the chemistry and method is the establishment 
of quantitative profiles of complex protein samples and ultimately total lysates of 
cells and tissues following the preferred method described above. In addition the 
reagents and methods of this invention have applications which go beyond the 
determination of protein expression profiles. Such applications include the following: 

Application of amino-reactive or sulfhydryl-reactive, differentially isotopically 
labeled affinity tagged reagents for the quantitative analysis of proteins in immuno 
precipitated complexes. In the preferred version of this technique protein complexes 
from cells representing different states (e.g., different states of activation, different 
disease states, different states of differentiation) are precipitated with a specific 
reagent, preferably an antibody. The proteins in the precipitated complex are then 
derivatized and analyzed as above. 

Application of amino-reactive, differentially isotopically labeled affinity tagged 
reagents to determine the sites of induced protein phosphorylation. In a preferred 
version of this method purified proteins (e.g., immunoprecipitated from cells under 
different stimulatory conditions) are fragmented and derivatized as described above. 
Phosphopeptides are identified in the resulting peptide mixture by fragmentation in 
the ion source of the ESI-MS instrument and their relative abundances are 
determined by comparing the ion signal intensities of the experimental sample with 
the intensity of an included, isotopically labeled standard. 

^mino-reactive, differentially isotopically labeled affinity tagged reagents are 
used to identify the A/-terminal ion series in MS n spectra. In a preferred version of 
this application\the peptides to be analyzed are derivatized with a 50:50 mixture of 
an isotopically lignt.and heavy reagent which is specific for amino groups. 
Fragmentation of the'peptides by CID therefore produce two /V-terminal ion series 
which differ in mass precisely by the mass differential of the reagent species used. 
This application dramatically^educes the difficulty in determining the amino acid 
sequence of the derivatized peptide. 
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Quantitative Analysis of Surface Proteins in Cells and Tissue 

The cell exterior membrane and its associated proteins (cell surface proteins) 
participate in sensing external signals and responding to environmental cues. 
Changes in the abundance of cell surface proteins can reflect a specific cellular 
state or the ability of a cell to respond to its changing environment. Thus, the 
comprehensive, quantitative characterization of the protein components of the cell 
surface can identify marker proteins or constellations of marker proteins 
characteristic for a particular cellular state, or explain the molecular basis for cellular 
responses to external stimuli. Indeed, changes in expression of a number of cell 
surface receptors such as Her2/neu, erbB, IGFI receptor, and EGF receptor have 
been implicated in carcinogenesis and a current immunological therapeutic 
approach for breast cancer is based on the infusion of an antibody (Herceptin, 
Genentech, Palo Alto, CA) that specifically recognizes Her2/neu receptor. 

Cell surface proteins are also experimentally accessible. Diagnostic assays 
for cell classification and preparative isolation of specific cells by methods such as 
cell sorting or panning are based on cell surface proteins. Thus, differential analysis 
of cell surface proteins between normal and diseased (e.g., cancer) cells can identify 
important diagnostic or therapeutic targets. While the importance of cell surface 
proteins for diagnosis and therapy of cancer has been recognized, membrane 
proteins have been difficult to analyze. Due to their generally poor solubility they 
tend to be under-represented in standard 2D gel electrophoresis patterns and 
attempts to adapt 2D electrophoresis conditions to the separation of membrane 
proteins have met limited success. The method of this invention can overcome the 
limitations inherent in the traditional techniques. 

The analysis of membrane proteins is challenging because they generally are 
difficult to maintain in solution under conditions that are compatible with high 
sensitivity analytical instruments such as mass spectrometers. The application of 
the methods of the present invention to the analysis of membrane proteins is 
exemplified using human T cell lymphoma cell line Jurkat for membrane protein 
labeling and extraction and the well characterized human prostate epithelial cell line 
P69SV40T and two P69SV40T sublines which differ in IGF-1 receptor expression by 
factor of 10 to exemplify quantitative, differential analysis of membrane proteins. 
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Jurkat cells are an appropriate model system because the cells are easy to 
grow in large numbers and because the modulation of cell surface proteins in 
response to different stimuli and experimental conditions has been well 
characterized in T lymphocytes. Commercially available biotinylating reagents or 
more generally affinity tagging reagents are employed to derivatize lysine residues 
and the free N-termini. Water soluble biotinylating reagents such as Sulfo-NHS (N- 
hydroxy succinimide) biotin and analogs (Sulfosuccinimidyl-6-(biotinamido)- 
hexanoate, Pierce, Rockford, IL) which have been used extensively for labeling cell 
surface proteins can be employed.. The reaction of NHS esters with primary amines 
is best at neutral pH values and above and is compatible with the presence of 
organic solvent such as DMSO or DMF. Biotinylation of cell surface proteins from 
the Jurkat cells is carried out in PBS buffer at pH 7.2. Cells (1 x 10 7 ) are washed 
with PBS buffer to remove contaminating serum and other proteins from the culture 
medium. The cells are resuspended at 25 x 10 6 cell/ml and reacted with 0.5 mg/ml 
of Sulfo-NHS-Biotin (Pierce, Rockford, IL) for 30 min at RT. The labeled cells are 
washed twice with cold PBS to remove unreacted biotinylating reagent. Biotinylated 
cells are solubilized at 5 x 10 7 cells/ml in lysis buffer containing 1 % Triton X-1 14. 
Triton X-1 14 has the property of phase-partitioning into detergent phase and 
aqueous phase at 30°C. Following the phase partitioning, detergent phase is 
removed from the aqueous phase by centrifugation at 300xg. Phase partitioning has 
previously been successfully used to enrich cell membrane. Also, this technique 
was found to enrich membrane proteins from Jurkat cell lysates . Triton phase is 
diluted 1:5 (v/v) using 50 mM ammonium bicarbonate buffer, pH 8.5, and high-purity, 
modified porcine-trypsin is added to digest the proteins at a concentration of 12.5 
ng/ml for overnight at 37 °C. Trypsin is neutralized by the addition of a cocktail of 
serine protease inhibitors and tryptic peptides are isolated by the avidin affinity 
chromatography techniques. Eluted peptides are separated e.g., by pLC methods 
and identified by searching peptide sequence databases, using for example, the 
Sequest program. 

The human prostate epithelial cell line P69SV40T which was immortalized 
with SV 40 T antigen has been well characterized . This cell line is immortal but not 
tumorigenic and expresses type 1 insulin like growth factor receptor (IGF-1R) at 2 x 
10 4 receptors per cell. A subline, called M12, was derived from P69SV40T by 
sequential passage in male athymic nude mice . This cell line is highly tumorigenic 
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and metastatic and expresses 1.1 x 10 3 IGF-1R per cell. The relative difference in 
the abundance of IGF-1R in the cell lines P69SV40T and M12 can be quantitatively 
determined using methods of this invention adapted for application to membrane 
proteins. Since the number of IGF-1 R for these cell lines has already been 
determined, this well characterized system can provide a reference to validate the 
efficiency of the quantitative methods of this invention 

S P69SV40T cells (1 x 10 7 ) are biotinylated with an isotopically heavy biotin 
tagged arwio reactive reagent and the M12 cells (1 x 10 7 ) are biotinylated with a 
correspondingNsotopically light amine reactive biotin tagged amino reactive reagent. 
IGF-1 R is then immunoprecipitated from the combined lysate of both cell lines using 
an antibody against nbman IGF-1 R and the total mass of immunoprecipitated 
proteins is digested with tlypsin. Trypsin is then neutralized, e.g., by the addition of 
inhibitors and tagged peptide^re purified by biotin-avidin affinity chromatography. 
The eluted peptides are analyzec!kby LC-MS and LC-MS N for peptide quantitation 
and identification, respectively, as has been described above. Quantitation in this 
experiment is facilitated by the option tofc^e selective ion monitoring in the MS. In 
this mode only the masses of tagged peptideSons expected to derive from IGF-1 R 
need be monitored. 

The described technique can be applied to compare the differences in relative 
abundance of cell surface proteins between parental prostate cell line (P69SV40T) 
and M12 cells to detect and identify those cell surface proteins whose expression 
level is different in the two cell lines and which may be characteristic of the different 
cell states. Using the methods described herein global, relative quantitation of the 
cell surface proteins in any two or more cell lines can be analysed to detect and 
identify those cell surface proteins characteristic of the different cell states. Results 
can be independent confirmed using procedure such as 1 D or 2[J\gels, if applicable, 
or quantitative western blotting to confirm quantitation results. \ 

It is expected that the experimental variability of quantitation of cell surface 
proteins will be considerably better than the accuracy of quantitation achieved by 
currently available cDNA array technology. In addition to relative protein quantity 
and identity, the method can also be used to reveal the orientation of the protein in 
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the membrane, based on the presumption that intact, alive cells will exclude the 
biotinylating reagent. 

Alternative methods can be applied to enhance the selectivity for tagged 
peptides derived from cell surface proteins. For example, tagged cell surface 
proteins can be trypsinized directly on the intact cells to generate tagged peptides, 
purified and analyzed as discussed. In addition, traditional cell membrane 
preparations may be used as an initial step to enrich cell surface proteins. These 
methods can include gentle cell lysis with a dounce homogenizer and series of 
density gradient centrifugations to isolate membrane proteins prior to proteolysis.. 
This method can provide highly enriched preparations of cell surface proteins. 
Affinity tagged proteins may also be isolated by affinity chromatography prior to 
proteolysis as well as after proteolysis. This chromatography can be performed in 
the presence of surfactants such as TX-100, NP-40 or Tween-20 to maintain protein 
solubility. The sequential application of affinity chromatography steps (one for the 
intact protein and one for the tagged peptide fragments) provides a high degree of 
selectivity. These alternative methods are easily scalable for the detection of low 
abundance membrane proteins and the relative quantity of tagged peptides tagged 
is maintained through the selective enrichment steps. 



In the application of the methods of this invention to cell surface proteins, 
once the tagged proteins are fragmented, the tagged peptides behave no differently 
from the peptides generated from more soluble samples. 



Synthesis of affinity tagged protein reactive reagents that are selective for 
certain proteins groups 

(Cnthetic routes exemplary affinity tagged reagents suitable for use in the 

methods of mis invention are provided in Schemes 2-3 where well-known synthetic 
techniques are employed in synthesis of the non-deuterated and deuterated 
reagents. . 

Biotinyl-iodoacetylamidy]^4,7,10 trioxatridecanediamine 4 (Scheme 3) 
consists of a biotin group, a chemic^Hy inert spacer of capable of being isotopically 
labeled with stable isotopes and a iodoabetoamidyl group, respectively. The biotin 
group is used for affinity enrichment of peptiafe^derivatized with the reagent, the 
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ethylene glycol linker is differentially isotopically labeled for mass spectral analysis 
ano4tie iodoacetamidyl group provides specificity of the reagent for sulfhydryl- 
contairoqg peptides. The reagent an be synthesized in an all hydrogen form 
(isotopicaliyjight form) with and with 1-20, and preferably 4-8 deuterium atoms in the 
linker (isotopically heavy forms). 

Analysis of Velocities of Multiple Enzymes in Cell Lysates 

Monitoring enzyme functions by biochemical assays is an essential diagnostic 
tool that employs a multitude of analytical techniques including spectrophotometry, 
fluorometric, and radiometric detection of products. However, current methods are 
difficult to use for assaying several enzymes simultaneously in a single sample. 
Mass spectrometry for quantification of a collection of metabolites in biological fluids 
has emerged as a powerful approach for the analysis of birth defects (Morris et al., 
1994), but this analytical technique has not been developed for the direct analysis 
of rates of individual enzymatic steps. The analytical method described herein for 
monitoring and quantification of enzymatic activities in cell homogenates and other 
biological samples permits simultaneous (multiplex) monitoring of multiple reactions, 
and can be readily automated. 



feature of the method of this invention as applied to enzyme assays is the 
use of elebfrospray ionization mass spectrometry (ESI-MS) (Cole et al., 1997) for the 
simultaneous^tetection of enzymatic products and chemically identical internal 
standards, whiclvare distinguished by stable isotope (deuterium) labeling. A second 
feature is the use of affinity tagged reagents containing an enzyme substrate which 
when combined with affinftyDurification provide for facile capture of enzymatic 
products from crude biologicaNjuids. The affinity tagged reagents are designed to 
contain a target substrate for an enzyme of interest that is covalently attached to an 
affinity tag via a linker. Action of the enzyme of interest on the substrate conjugate 
causes cleavage or other modification tn^t changes its molecular mass (Scheme 4). 
The change of mass is detected by ESI-MsNThe linker and affinity tag used 
preferably facilitate ionization by ESI, block acuon of other enzymes in the biological 
fluid, and allow highly selective capture from the complex matrix for facile 
purification. 
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\&n example of this approach is the design and synthesis of affinity tagged 
enzyme substrate reagents 1 and 2 (Scheme 5) to simultaneously assay lysosomal 
(3-galactosioase and N-acetyl-a-D-glucosaminidase, respectively. Deficiency of the 
former enzymkresults in one of the lysosomal storage diseases, GM r 
gangliosidosis, ^condition that occurs in the population with a frequency of about 1 
in 50,000 and leadXlo early death of affected children.. Deficiency of N-acetyl-R-D- 
glucosaminidase results in the rare lysosomal storage disorder Sanfilippo syndrome 
type B. This example has been described in Gerber et al., 1999, which is 
incorporated by reference^f^rein in its entirety. 

Conjugates 1 and 2 consist of biotin as an affinity tag, which is coupled to 
sarcosine. Biotin allows highly specific capture of the substrate conjugate through 
non-covalent binding to streptavidin immobilized on agarose beads (Bayer et al., 
1990). Sarcosine provides an N-methylated amide linkage to biotin to block the 
enzyme biotinidase, which is often present in the cellular fluids and could cause 
cleavage of the conjugate molecule during the assay (Wilbur et al., 1997). In 
addition, it was found that biotinyl-sarcosine conjugates can be displaced from 
streptavidin by addition of biotin. The N-biotinylsarcosine block is linked to a 
polyether diamine, the length of which can be varied to avoid mass/charge overlaps 
of products and internal standards. The linker also allows facile introduction of 
multiple deuterium atoms (i.e., 8 deuteriums in 5 and 4 in 6, Scheme 5) to permit the 
synthesis of internal standards. The d8-linker was made by reacting 
DOCH 2 CH 2 OCH 2 CH 2 OD with CD 2 = CDCN in benzene with catalytic NaOD 
(Ashikaga, K., et al., 1988) and the resulting dinitrile was reduced to the diamine with 
Ra-Ni. The d4-linker was made in the same way using ethylene glycol and CD2 = 
dCDCN in CH 3 CN and catalytic NaOH. 

"Trk^ddition, the linker is hydrophilic to ensure good water solubility of the 
substrate cortfugate, and it has basic groups which are efficiently protonated by ESI 
and thus ensure sfeqsitive detection by mass spectrometry. The target carbohydrate 
substrates are attachecHo the polyether linker by a (3-alanine unit (Scheme 5). The 
enzymatic product conjugates and 4 are also shown Scheme 5. Conjugates 1 and 
2 were prepared as shown in Scnfen^e 5. All reagents were purified to homogeneity 
by reverse-phase HPLC and characterised by high-field 1 H NMR and ESI-MS. The 
substrate was linked to the diamine spacerl>yMichael addition of the latter onto the 
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Vacryloylamidophenyl glycoside, (Romanowska et al. f 1994) and the intermediate 
wfcas coupled with the tetrafluorophenyl ester of N-biotinylsarcosine (Wilbur et al., 
199^. 

The ESI-MS assay of (3-galactosidase and N-acetyl-R-D-glucosaminidase is 
based on enzymatic cleavage of the glycosidic bond to release monosaccharide and 
conjugates 3 and 4 (mass differences are 162 and 203 Da, respectively). In a typical 
procedure, 0.2 mM 1 and 0.3 mM 2 were incubated with sonicated cultured 
fibroblasts from individual patients with p-galactosidase deficiency and with 
fibroblasts cultured from unaffected people. After incubation, labeled internal 
standards 5 and 6 were added, and the biotinylated components were captured on 
streptavidin-agarose beads. Quantitative strepavidin capture efficiency from a cell 
homogenate was observed with model reagents. After purification by multiple 
washings to remove nonspecifically bound components, the biotinylated products 
were released by free biotin, and the eluant was analyzed by ESI-MS. About 85% 
release of the biotinylated products was observed after incubation with excess biotin 
for 90 min. A blank was obtained by quenching the assay with all components 
present at time zero. 

A typical procedure, cell protein (75 pg) in 15 pL of water was added to 15 pL 
buffer (0.1 M Na citrate, pH 4.25) containing 2 (0.3 mM) and 1 (0.2 mM, added 5 h 
after addition of cell protein). After incubation for 5.5 h at 37 °C, the reaction was 
quenched by addition of 200 pL of 0.2 M glycine carbonate buffer, pH 10.3, and 5 
and 6 (1 nmol each) were added. After centrifugation to remove cell debris, the 
supernatant was loaded onto a bed of streptavidin-agarose (7 nmol biotin binding 
capacity, Pierce) in a small filtration device (micro BioSpin, Bio-Rad). After 5 min, 
filtration was effected by centrifugation, and the gel bed was washed with 0.1% 
Triton X-100 ( about 1 min incubation, then spin) and then six times with purified 
water (Milli-Q, Millipore). Elution was carried out in 25 pL of 50% methanol 
containing 56 nmol of free biotin (1-10 h incubation, then spin). Filtrate was diluted 
4-fold with 50% methanol/water, and 1 pL was analyzed by ESI-MS. 

The ESI-MS spectrum of the blank (Figure 6A)is remarkably simple, showing 
peaks of the (M + H) + ions from reagents 1 and 2 (m/z 843 and 840), internal 
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standards 5 and 6 (m/z 689 and 641), and trace amounts of products 3 and 4 (m/z 
681 and 637). Ions due to clusters of biotin also appear in the spectrum but did not 
interfere with the analysis. The presence of nondeuterated products in the blank 
may be due to nonenzymatic substrate reagent hydrolysis during sample work up or 
to collision-induced dissociation of the substrate ion in the gas phase. A MS/MS 
spectrum of the (conjugate 1 + H) + ion at m/z 843 gave a prominent fragment of 
(conjugate 3 + H) + at m/z 681 (spectrum not shown). The ESI-MS spectrum of a 
sample incubated with cell homogenate from a healthy individual clearly shows the 
0-galactosidase product at m/z 681 and the N-acetyl-a-D-glucosaminidase product 
at m/z 637 (Figure 6B). Triplicate enzymatic reactions using cells from a healthy 
patient yielded a 0-galactosidase specific activity of 51 ±3 nmol/h/(mg cell protein) 
and an N-acetyl-a-D-glucosaminidase specific activity of 1 .4 ± 0.3 nmol/h/mg. Time 
course studies confirmed that the initial reaction velocities were being measured. 
Values obtained with cells from six healthy individuals ranged from 36 ± 4 to 68 ± 3 
nmol/h/mg for p-galactosidase and 0.9 ± 0.05 to 2.3 ± 0.4 nmol/h/mg for N-acetyl-a- 
D-glucosaminidase. In contrast, very little enzymatic product above the blank level 
(0.9 ± 0.9 and 0.8 ± 0.6 nmol/h/mg) was observed when cells from two patients with 
p-galactosidase deficiency were used, whereas N-acetyl-a-D-glucosaminidase 
activity is clearly detected (Figure 6C). These spectra were obtained with 0.75 pg of 
cell protein, corresponding to ~ 1000 fibroblasts. Thus the ESI-MS method has very 
high sensitivity for biomedical applications. 

ESI-MS was carried out on a Finnigan LCQ ion trap instrument. Data were 
collected in full scan mode from m/z 625 to 875 by direct infusion at 1 .5 pL/min. 
Specific activities were obtained from the ratio of product to internal standard ion 
peak areas (averaged over 30 scans). 



ESI-MS can oe^roadly applied. The multiplex technique can be expanded to assay 
dozens or more enzymes simultaneously in a single reaction, obviating the need for 
multiple assays to assistsn confirming diagnoses of rare disorders. The method can 
be used to measure severahegizymes simultaneously when evaluating the rate of 
chemical flux through a specific ote^hemical pathway or for monitoring biochemical 
signaling pathways. The affinity tag-capture reagent method for isolation of affinity 
tagged reaction products and substrates from complex mixtures is technically simple 




approach described for assaying enzymes using substrate reagents and 



smd can be readily automated, particular when biotin-strepavidin is employed. 
Because of the high sensitivity of the ESI-MS detection employed, which requires 
only^sub-microgram quantities of the substrate reagents per assay, the synthesis of 
several hundred substrate reagents on a low-gram scale becomes practical and 
economical. Since most enzyme active sites are exposed to solvent, it is possible to 
attach an Affinity tagged linker to most enzyme substrates while preserving 
enzymatic activity. Scheme 6 provides the structures of several additional enzyme 
substrates, suitable for use in this method, indicating by arrows allowable positions 
for tag attachment sites. Allowable tag sites for additional enzyme substrates can 
be determined by review of X-ray crystal structures of enzyme-substrate or enzyme- 
substrate analog structures. Using a standard computer graphics program , 
available X-ray data atod by attaching an extended chain butyl group (as a model for 
the affinity tagged linker^ to potential tag attachment sites, suitable attachment sites 
that show there are no enfcyme-atoms in van der Waals overlap with the model tag 
can be predicted. \ 

Analogous methods to those described above can be applied to the analysis 
of enzymes associated with other Sanfillipo Syndromes (A, C and D). SFA is 
associated with heparin sulfamidase, SFC is associated with acetyl-CoA-alpha- 
glucosaminide N-acetyltransferase and SFD is associated with N-acetylglucosamine 
6-sulfatase. Exemplary affinity tagged enzyme substrate reagents useful in the 
analysis of these enzymes and the diagnosis of these disorders are provided below. 
The methods can also be applied of the diagnosis of Niemann-Pick Type A and B 
disease by assaying for acid sphingomyelinase and to the diagnosis of Krabbe 
disease by assaying for galactocerebroside beta-galacatosidase. These enzymes 
are currently assayed employing fluorophore-derivatized reagents as indicated in 
Scheme 7. Enzyme substrate reagents for assay of these enzymes in the methods 
herein can be readily prepared by replacement of the fluorophore with an A-L group 
herein. This approach to preparation of affinity tagged enzyme substrates is 
generally applicable to any known fluorophore -derivatized enzyme substrate or 
substrate analog. 

Table 4 provides exemplary enzymes that are associates with certain birth 
defects or oisease states. These enzymes can be assayed by the methods 
described herein. 
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Assaying Enzymatic Pathways for Carbohydrate-Deficient Glycoprotein 
Syndromes (CDGS) 

The methods and reagents of this invention can be employed to quantify the 

velocities of multiple enzymes pertinent to diagnosis of CDGS diseases. 

CDGS Type la and lb are caused by the deficiency or absence of the 
enzymes phosphomannoisomerase (PMIb) (Type lb) and phosphomannomutase 
(PMM2) (Type la) which are part of a multistep pathway (Scheme 8) for conversion 
of glucose to mannose-1 -phosphate (Freeze, 1998). The monosaccharide 
substrates involved in the pathway are fructose-6-phosphate, mannose-6- 
phosphate, and mannose-1 -phosphate. These monosaccharides can be somewhat 
difficult to convert to substrate conjugates because it is not a priori clear which atom 
on the sugar should be conjugated with the linker without impairing enzyme activity. 
PMIb and PMM2 can, however, be assayed indirectly. Mammalian cell microsomes 
contain dolichol-P-mannose synthase which catalyzes the reaction of dolichol- 
phosphate with GDP-mannose to form dolichol-P-mannose and GDP (Scheme 8, 
Chapman et al. 1980). This synthase can be assayed using the methods of this 
invention, specifically with a biotin-linker substrate. Microbial PMM and the enzyme 
which makes GDP-mannose from GTP and mannose-1 -P, GDP-mannose 
pyrophosphorylase, are readily purified from bacteria and yeast (Glaser, 1966, 
Preiss, 1966), and these enzymes can be supplied exogenously to the enzyme 
assay. If PMIb activity is deficient, little or no mannose-6-P will be made when the 
reaction sequence is started by addition of fructose-6-P. Without mannose-6-P, 
mannose-1 -P and GDP-mannose will not be made, and thus no conjugated-dolichol- 
P-mannose will be detected by ESI-MS. Exogenous GTP is supplied as a 
requirement for the GDP-mannose pyrophosphorylase step, and ATP is omitted so 
that mannose-6-P cannot be made from mannose. To assay PMM2, the reaction 
sequence is initiated with mannose-6-P, and PMM2 deficiency results in the failure 
to make conjugated-dolichol-P-mannose. 

The carrier dolichol is a -60- to 105-carbon isoprenoid. Evidence is 
accumulating that many enzymes that operate on carbohydrates attached to dolichol 
chains are tolerant to significant shortening of the dolichol chain; even 10- and 15- 
carbon dolichols are tolerated (Rush and Wachter, 1995). It appears that such 
enzymes act on the water-soluble carbohydrate portion of the dolichol conjugate and 
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thus have little or no requirement to bind the dolichol anchor. Based on this, an 
affinity labeled substrate for the direct assay of dolichol-P-mannose synthase and 
the indirect assay of PMIb and PMM2 is prepared by attaching an affinity labeled 
linker to the non-polar end of a short dolichol, such as the 10-carbon dolichol analog 
citronellol. 

\The synthesis of a biotinylated dolichol 10 -substrate conjugate containing a 
sarcosinyl linker (B-S-Do^O-P) is shown in Scheme 9. Protected citronellol R = t- 
BuSiMe 2 )\s regioselectively oxidized at the terminal alyllic methyl group (McMurry 
and Kocovsky, 1984), and the allylic alcohol is coupled with biotinylsarcosine active 
ester R = CH^. The citronellol 1 -hydroxy group is subsequently deprotected and 
phosphoiylatecWith POCI 3 (Rush and Wachter, 1995). In a parallel synthesis, d 5 - 
sarcosine, CD 3 NhJCD 2 COOH, is used to prepare the isotopically labeled (heavy) 
reagent for use as\n internal standard. d 5 - Sarcosine is readily prepares form 
commercially availably materials (BrCD 2 COOD and CD 3 NH 2 ) using standard 
synthetic techniques. \ 

The deuterated internal standard, B-d 5 -S-Dol 10 -P-Mannose, is synthesized 
enzymatically by incubating hen oviduct microsomes with GDP-mannose and the 
synthetic B-d 5 -S-Dol 10 -P substrate conjugate (Rush and Waechler, 1995). An added 
advantage of the B-S-conjugate is that it allows for a facile affinity purification of the 
microsomal mannosylated product by specific capture on agarose-streptavidin 
beads followed by elution with free biotin. 



This method employing affinity tagged short dolichol analogues is generally 
applicable for assaying other enzymes that operated on dolichol anchored 
carbohydrates. Such an approach is useful for the subsequent identification of 
enzyme deficiencies present in other types of CDGS that have not been yet 
identified. 

N SDGS Type II results from defective GlcNAc transferase II (GlcNAc-T II) 
which transfefsGlcNAc from UDP-GlcNAc to the 2-position of a mannose residue 
in the intermediatebrqnched oligosaccharide (the Core Region) in the process of 
building up the disialo-biarfteQnary chain (Scheme 10) (Schachter, 1986; 
Brockhausen et al, 1989). GlcN^tstransferase II is one of the six known enzymes 
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at mediate highly regiospecific glycosylation of the mannose residues in the Core 
Region. The Core Region is anchored at the reducing end to chitobiosylasparagine, 
where the asparagine residue is part of the peptide chain of the glycosylated protein. 
The latter structure unit in the substrate can be replaced by a hydrophobic chain 
5 without loss orenzyme activity (Kaur et al., 1991 ). Thus, the substrate conjugate for 
CDGS Type II is ab^embled by linking a affinity-labeled linker group to the reducing 
end to chitobiosylasparagine. However, the latter structure unit in the substrate can 
be replaced by a hydrophobic chain without loss of enzyme activity (Kaur et al., 
1991). For example, commerci&lW available a-D-manno- 
10 pyranosylphenylisothiocyanate carK>e coupled to a biotin-labeled linker and the 5,6- 
hydroxyls are selectively protected as iHqstrated in Scheme 1 1 (Paulsen and 
Meinjohanns, 1992). Coupling of the equatorial 3-OH with per-O-acetyImannosyl-1- 
trichloroacetamidate (Paulsen et al, 1993) will provide a disaccharide conjugate 
(Scheme 12). If a minor amount of coupling occur&sat the axial 2-OH group the 
products can be separated by HPLC. After deprotectioK the primary 6-OH is 
coupled with a second equivalent of per-O-acetylmannosylH^trichloroacetamidate to 
yield the Core Region conjugate. Deprotection of the O-acetyl groups yields the 
substrate conjugate for GlcNAc transferase I which can be converted to the GlcNAc- 
T II substrate by enzymatic glycosyl transfer using a Triton X-100 rao^it liver extract, 
a reaction that has been carried out on a preparative scale (Kaur et al., >981). 
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The synthesis of the deuterium labeled derivative needed for the internal 
standard is performed in parallel by using a labeled PEG-diamine building block 
(Gerber et al., 1999). The biotinylated trisaccharide is converted to the 
tetrasaccharide (product of GlcNAc-T II) by incubation with UDP-GlcNAc and 
25 transferase II ( (Kaur and Hindsgaul, 1 991 ;Tan et al., 1 996) and utilizing the B-S 
handle for affinity purification of the enzymatic products. 

CDGS Type V 

The lipid-linked oligosaccharide (LLO) that is transferred to the Asn residue of 
the glycosylated protein is composed of 2 GlcNAc, 9 mannoses, and 3 glucoses. It 
30 has recently been shown that microsomes from CDGS type V patients are greatly 
deficient in the enzyme that transfers one or more glucose residues during LLO 
biosynthesis (Korneretal, 1998). Since the transferase that attaches the 
carbohydrate unit of LLO to the Asn residue discriminates against the glucose- 
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deficient LLO, CDGS Type V patients have fewer numbers of carbohydrate units 
attached to glycoproteins, such as transferrin (Korner et al., 1998). However, the few 
carbohydrate units that are present are full-length, demonstrating that residual 
glucosyl transfer occurs in type V CDGS patients (Korner et al., 1998). Thus, 
quantification of the rate of Asn glycosylation by ESI-MS would constitute a viable 
assay of CDGS Type V syndrome. 

Synthetic peptides with 3-7 amino acid residues containing the Asn-Xaa- 
Ser/Thr sequence have been shown to be good substrates for glycosylation (Ronin 
et al., 1 981 ). The strategy for the ESI-MS assay of the oligosaccharide transferase 
relies on a B-S conjugate of an appropriate peptide containing the Asn-Xaa-Ser/Thr 
sequence (Scheme 13). A heptapeptide, NH 2 -Tyr-Gln-Ser-Asn-Ser-Thr-Met-NH 2 
(SEQ ID NO:1) has shown high activity in a previous study (Ronin et al., 1981). The 
peptide is readily available by standard peptide synthesis using an in-house 
automatic synthesizer. The heptapeptide and its glycoconjugates can be ionized by 
ESI to provide stable singly-charged ions. Coupling of BS-tetrafluorophenyl ester 
with NH 2 -Tyr-Gln-Ser-Asn-Ser-Thr-Met-NH 2 will directly yield the substrate for the 
transferase. Several products are expected from the enzymatic glycosylation and 
subsequent modifications of the oligosaccharide antenna. The products can be 
prepared enzymatically by incubating thyroid rough microsomes with BS-Tyr-Gln- 
Ser-Asn-Ser-Thr-Met-NH 2 and Dol-P-Glu (Ronin et al., 1981a), followed by affinity 
purification of the biotinylated products. Product distribution due to different degrees 
of glycosylation can be monitored by ESI-MS, and the major components can be 
purified by HPLC. An analogous procedure using a B-N(CD 3 )CD 2 CO- conjugate is 
used to prepare deuterated internal standards. 

The molecular masses of the ionized substrate conjugates for the set of 
enzymes assayed for CDGS la, lb, II, and V syndromes, as well as products and 
internal standards are compiled in Table 5, which shows that no isobaric overlaps 
among the (M + H) + species occur. The close spacing between the (M + Na) + ion 
from the Type la,b product and the (M + H) + ion of the demannosyiated B-(N-C 2 D 5 )- 
2,2-D 2 -Gly-Dol 10 -P internal standard can be readily avoided by adjusting the ESI-MS 
conditions by addition of Na + ions to generate the gas phase ions as Na-adducts. 
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All three of the targeted enzymes can be analyzed simultaneously in a single 
biological sample, such as a cell lysate. The PMM2 and PMIb cannot be assayed 
simultaneously because they require the addition of different exogenous substrates. 
Nevertheless, two assays using identical ESI-MS techniques can be used for 
diagnosing the various CDGS types instead of relying on a battery of different 
methods. 

Clinical Enzymology Assays 

A fibroblast cell pellet is thawed on ice. Sufficient 0.9% NaCI is added to give 
a final protein concentration in the lysate of ~- 5mg/ mL (typically 100 mcL), and 
the cell pellet is sonicated in ice water 5 times for 2 seconds each at moderate 
power. Total protein is determined spectrophotometrically using the BCA reagent 
(BCA Protein Assay kit, Pierce). 

The total enzyme reaction volume is 20 - 30 mcL. The substrate stock 
solutions aretri^intained at concentrations of 3mM (SFB) and 2mM (GM1). These 
concentrations werfesjieasured by 1H-NMR signal integration versus an internal 
standard (formamide profcn^of DMF). Final concentration of substrates is 0.3 and 
0.2 mM, respectively. A volumebf^reaction buffer (e.g. 200mM sodium citrate, pH 
4.5) equal to the difference of the substrate addition (2-3 mcL) plus sufficient cell 
sample volume to equal 50 u 75mcg total pTotein from 20-30 mcL is added to a 0.5 
mL Eppendorf tube, followed by substrate. The^ajriple is cooled on ice, and patient 
cell sample is added. The reaction is initiated by incubation at 37 °C. 

For SFB: The reaction is allowed to proceed for 4.5 - 6 hours, after which 
GM1 substrate can be added or the reaction can be quenched with 100 mcL of 
200mM glycine-carbonate buffer, pH 10.5. 

For GM1: The reaction is allowed to proceed for 0.5 hours. Quenching is as 
for SFB. 
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After quenching, the samples are placed on ice. Internal standards are added 
(1nmol each, i.e. 50 mcL of a 0.02 mM solution). The samples are microfuged at 
-15,000 rpm for 2 min at room temperature to pellet cell debris. Streptavidin- 
Agarose beads (Immunopure immobilized streptavidin, Pierce) are placed in a micro 
bio-spin chromatography column (Bio-Rad). Sufficient beads are added to give a 
total biotin binding capacity of 5 nmol (typical binding capacity 100pmol per mcL of 
beads as determined by Pierce). The sample supernatant is transferred to the 
bio-spin tube and allowed to bind for 10 minutes at room temperature. The sample is 
spun at -3,000 rpm to remove excess supernatant, then washed once with 0.01% 
Triton X-100 and at least five times with purified water, spinning the tube in-between 
to remove solution. For each wash, sufficient wash solution is added to fill the 
bio-spin tube. 



The purified beads are then treated with 30 mcL purified water, followed by 
10 mcL of a 4mM biotin solution. The tubes are capped at the bottom to prevent 
leakage and allowed to incubate at 2-8°C for 2-12 hours. The samples are spun at 
-3,000 rpm to elute the sample into a clean Eppendorf tube. 

The sample is then diluted with 60 mcL of 50% methanol/water and infused 
into the ion-trap mass spectrometer. The ESI-MS spectrum is tuned to reduce 
non-specific cleavage of the samples by first analyzing a blank sample (cell lysate 
added after reaction quench). The infused sample is analyzed by ion chromatogram 
integration of a 1amu-wide window about the (M + H+) + ions of product and internal 
standard. 



Results are reported in nmol product formed/hour of incubation/milligrams 
total protein in reaction mixtures. 



Clinical Analysis of Patient Samples for GM1 and SFB 

Patient skin fibroblasts were obtained as frozen pellets, and stored at Q20°C 
until use. Two GM1 affected samples and six normal controls were analyzed. 
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50 mcL of 0.9% NaCI was added to each patient cell pellet. The pellets were 
lysed by sonication in ice water 5x for 2 seconds each at moderate sonication 
power, chilling the microtip in ice water in between sonications. 

Samples were quantitated by BCA (Pierce) assay as follows: 
Reagent A and B were mixed in 50:1 ratio as described. A protein standard 
curve was prepared using bovine serum albumin as a standard at concentrations of 
2, 1, 0.5, 0.2, and 0.05 mg/ mL. A portion of the patient sonicates were diluted 1:15 
in water, and 5 mcL of each diluted patient sample and standard curve point was 
added to separate glass culture tubes containing 200 mcL water, in duplicate. The 
samples were then diluted with 1 mL of the mixed BCA reagent, vortexed to mix, and 
incubated at 37 °C for 60 minutes. The samples were allowed to cool to room 
temperature, and analyzed against a blank containing only 200 mcL water. The 
samples were analyzed by monitoring absorbance at 562nm in polystyrene cuvettes. 
Average patient absorbance values were blank corrected and compared to 
standards via linear regression. 

The patient protein concentrations were determined to be: 

1. (Affected) 12.2mg/ mL, 2. (Normal) 10.8mg/ mL, 3. (Normal) 11.9mg/ mL, 4. 

(Normal) 12.1 mg/ mL, 5. (Normal) 10.3mg/ mL, 6. (Normal) 7.79mg/ mL, 7. (Normal) 

15.7mg/ mL, 8. (Affected) 11.4mg/ mL 

Incubations were performed in a total of 30 mcL of total volume. The amount 
of reaction buffer (200mM sodium citrate, pH 4.25) added to blank Eppendorf tubes 
was the difference of the substrate volume (3 mcL of each substrate stock solution, 
2mM for GM1 and 3mM for SFB, for a total of 6 mcL) plus the volume of cell lysate 
required to equal 75mcg total protein, from 30 mcL. For example, patient 1. 
incubation mixture initially contained 3 mcL of SFB substrate solution, 6.14 mcL 
patient cell lysate, and 17.86 mcL reaction buffer. The GM1 substrate was added 
later in the incubation (see below). 
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Each patient sample was analyzed in triplicate. The reaction mixtures were 
kept on ice during preparation, and the reaction was initiated by transfer to a 37°C 
water bath. 5.5 hr into the incubation, 3 mcL GM1 substrate was added to each 
reaction, and after an additional 0.5 hours the reactions were placed on ice and 
quenched with 200 mcL of a 200mM glycine-carbonate buffer, pH 10.25. 

The purification and analysis procedures are as described in Clinical 
Enzymology Assay (Typical). 

The resultant enzyme activities, as an average standard deviation nmol 
product/hour incubation/mg total protein: 
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0.6 


0.70 


0.20 



42 



The following synthetic methods refer to Schemes 14-23. 

Synthesis for GM1 -gangliosidosis (beta-D-galactosidase deficiency) 

V 2,3,5,6-Tetrafluorophenyl trifluoroacetate (1) 25g (0.15mol) 

2,\5,6-tetrafluorophenol, 35 mL (0.2mol) trifluoroacetic anhydride and 0.5 ml_ boron 
trifluoride etherate were refluxed for 18 hours under argon atmosphere. 
Trifluoroacetic anhydride and trifluoroacetic acid were removed by distillation at 
room teiTHDerature. The trifluoroacetic anhydride fraction was returned to the mixture, 
and the reaction was refluxed for 24 hours. This was repeated twice. After final 
distillation at\oom temperature, the desired product 1 was distilled at reduced 
pressure (62°Q(45mmHg) to produce a colorless liquid (30g, 82%). 1H-NMR. 
(Gamper, H. B., 1v993). Biotin-2,3,5,6-tetrafiuorophenyl ester (2) A 2.5g (10.3mmol) 
quantity of d-biotin rn 20 mL anhydrous DMF under argon atmosphere was warmed 
to 60°C with stirring t\effect dissolution. 1.7 mL (12.5mmol) triethylamine was 
added, followed by 3.4g\{12.5mmol) 1 . The mixture was stirred for 2 hours, after 
which the solvent was removed by rotary evaporation. The resultant semi-solid was 
triturated with 15 mL ether twice to produce a white solid (2.6g, 65%). 1 H-NMR. 
(Wilbur, D. S., et al., 1997). N-r^ethylglycylbiotinamide-methyl ester (3) A 2.5g 
(6.4mmol) quantity of biotin tetraffojorophenyl ester in 30 mL anhydrous DMF under 
argon atmosphere was added to a mixture of 1 .1g (7.7mmol) N-methylglycine methyl 
ester hydrochloride dissolved in 10 mL arihydrous DMF and 1.25 mL (9.0 mmol) 
triethylamine. The reaction mixture was stihred at room temperature for 2 hours, then 
the solvent was removed by rotary evaporation. The residue was extracted with 
chloroform (2x100 mL), washed with water (2x2XnriL), and dried with anhydrous 
sodium sulfate. The solvent was removed under vacuum to yield 2.1g (98 %) of 
methyl ester of N-methylglycine biotinamide as an oilNjI H-NMR. (Wilbur, D. S., et al., 
1997). \ 

4. N-methylglycylbiotinamide acid (4) N-Methylglycylbiotinamide methyl ester 
was hydrolyzed in a mixture of 31 mL MeOH and 10 mL of 1 N 
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NaOH at room temperature with stirring for 1 hour. The mixture was diluted with 50 
mL 50% MeOH/water and neutralized with cation exchange resin, hydrogen form 
(AG MP-50, BioRad). The solution was filtered, the resin washed (3x50 mL) with 
50% MeOH/water, and the solvents removed by rotary evaporation to yield 1 .6g 
(90%) of N-methylglycylbiotinamide acid as an off-white solid. 1H-NMR. (Wilbur, D. 
S., etal., 1997). 

5. p-Acrylamidophenyl-p-D-galactopyranoside (5) 40mg (0.15mmol) 

p-aminophenyl p-D-galactopyranoside was added to 25 mL methanol and 200 mcL 
triethylamine with stirring. The solution was chilled in an ice bath. 53.3mg (0.6mmol) 
acryloyl chloride was dissolved in 5 mL dry methylene chloride and added dropwise 
to the stirred solution over 5 minutes. The reaction was allowed to return to room 
temperature, followed by 2 hours of stirring. The solution was then treated with 
successive anion and cation exchange resins (AG MP-1 and AG MP-50, 
respectively, BioRad) until neutral pH was obtained with moist pH paper. Solvent 
was removed by rotary evaporation to yield a solid (43mg, 90%). 1H-NMR. 
(Romanowska, A., et al., 1994). Michael addition product of 
4,7,10-trioxa-1 ,13-tridecanediamine and 5 (6) 20mg (0.07mmol) 5 was added to a 
stirred solution of 80mg (0.35mmol) 4,7,1 0-trioxa-1 ,13-tridecanediamine in 5 mL 
0.2M sodium carbonate, pH 10.5 at 37°C. The reaction was allowed to proceed for 3 
days, after which the solution was neutralized with dilute trifluoroacetic acid and 
purified by reverse-phase HPLC (Vydac C-18 prep-scale column, 6 mL/min. Mobile 
phase: H 2 0 (0.08%TFA)/ ACN (0.08%TFA)) to give 7.3 mg of product. 
(Romanowska, A., et al., 1994). 

7. GMlsubstrate conjugate of 4 and 6 (7) A 2.5mg (7.4mcmol) quantity of 4 was 
dissolved irM^mL anhydrous DMF with stirring, under argon atmosphere. 5 mcL 
triethylamine was add^d, followed by 2.3mg (8.8mcmol) 1 . The formation of active 
ester was monitored by silfca>C!XC (5:1 CHCI 3 /CH 3 OH, Rf 0.5, UV) by briefly drying 
the spotted TLC plate with a strearh-o^air. After 25 minutes, the mixture was added 
to 3.2mg (5.9mcmol) 6 in 1 mL anhydrous'BJ^F. After 2 hours, the solvent was 
removed by vacuum centrifugation and the final^roduct was purified by 
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reverse-phase HPLC (Vydac C-18 prep-scale column, 6 mL/min. Mobile phase: H20 
(0.08%TFA)/ ACN (0.08%TFA)). Yield 4.6 mg. (For analogous chemistry, see 
Wilbur, D>^, etal., 1997). 

8. 1,2,10,1 1-octadeutero-3,6,9-trioxa-1,11-undecanedinitrile (8) 1g (9.4mmol) of 
diethylene glycol was dissolved in 2 ml_ D20 in a 10 mL round bottom flask under 
argon atmosphere. The D 2 0 was removed by rotary evaporation and the process 
was repeated 4 times. The d-2 diethylene glycol was additioned with 25 mL dry 
benzene, followed by 1.6 g (28.2mmol) d-3 acrylonitrile with stirring under argon 
atmosphere. After 12h, the solvent was removed under reduced pressure and the 
resultant semisolid was extracted with chloroform (2x5 mL). The solvent was 
removed by rotary evaporation to yield 1 .85g (89%) product. (Ashikaga, K., et al., 
1988). 

9. 2,3,1 1,1 2-octadeutero-4,7,10-trioxa-1 1 13-tridecanediamine (9) Raney 
nickel (Aldrich) was washed five times with anhydrous methanol by inversion and 
decantation. 50mg of the washed catalyst was placed in 20 mL anhydrous methanol, 
followed by 1g (4.6mmol) 8 in a 50 mL screw-cap vial fitted with a Teflon-lined 
rubber septum. The vial headspace was flushed for a few min with H 2 gas via an 
18-gauge needle piercing the septum. The cap was screwed on tightly and the 
entire assembly was charged to 40psi H 2 and placed in a hot water bath (80°C) for 4 
hours, after which the solid catalyst was removed by filtration and the methanol 
evaporated. The final product was purified by reverse-phase HPLC (Vydac C-18 
prep-scale column, 6 mL/min. Mobile phase: H 2 0 (0.08%TFA)/ ACN (0.08%TFA)). 
Yield 180 mg. (Ashikaga, K., et al., 1988). 

10. Deuterated analog of 6 (10) 25mg (0.09mmol) 5 was added to a stirred 
solution of 90mg (0.4mmol) 9 in 5 mL 0.2M sodium carbonate, pH 10.5 at 37°C. The 
reaction was allowed to proceed for 3 days, after which the solution was neutralized 
with dilute trifluoroacetic acid and purified by reverse-phase HPLC (Vydac C-18 
prep-scale column, 6 mL/min. Mobile phase: H 2 0 (0.08%TFA)/ ACN (0.08%TFA)). 
Yield 6 mg. 
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1 1 . Deuterated analog of 7 (1 1 ) A 3mg (8.4mcmol) quantity of 4 was dissolved 
in 0.7 ml_ anhydrous DMF with stirring, under argon atmosphere. 5 mcL 
triethylamine was added, followed by 2.4mg (8.9mcmol) 1 . The formation of active 
ester was monitored by silica TLC (5:1 CHCI 3 /CH 3 OH, Rf 0.5, UV) by briefly drying 
the spotted TLC plate with a stream of air. After 25 minutes, the mixture was added 
to 6mg (1 1 mcmol) 1 0 in 1 mL anhydrous DMF. After 2 hours, the solvent was 
removed by vacuum centrifugation and the final product was purified by 
reverse-phase HPLC (Vydac C-18 prep-scale column, 6 mL/min. Mobile phase: H 2 0 
(0.08%TFA)/ ACN (0.08%TFA)). Yield 1.8 mg. 

12. GM1 internal standard conjugate (12) 1.8mg 1 1 was added to 2 mL of 
100mM TrismJmM MgCI2, pH 7.3 buffer with stirring. 15 units recombinant 
p-D-galactosidas&i [Sigma) was added, and after 12 hours the mixture was purified 
by reverse-phase HPLC (Vydac C-18 prep-scale column, 6 mL/min. Mobile phase: 
H 2 0 (0.08%TFA)/ACN (bsQ8%TFA)). Yield 1.5 mg. 

Polyether diamine linker synthesis (Second Generation) 

Synthesis is based on chemistry previously described (Kataky, R. et. al., 
1990), with minor modifications and an additional two steps. As an example, 
deviations from the established procedure as well as exact details for the additional 
steps are outlined below for the starting material diethylene glycol. 

1 ,1 1-Dicyano-3,6,9-trioxaundecane (13) To a stirred solution of 2% (w/v) 
sodium hydroxide (5 mL) and diethylene glycol (5.3g, 50mmol) was added 
acrylonitrile (7.95g, 150mmol). The mixture was stirred at room temperature 
overnight and additioned with 50 mL dichloromethane. The organic layer was 
washed 2x with brine and dried (MgSOJ. The solvent was removed by rotary 
evaporation. The oily residue was treated with 200 proof ethanol, and the solvent 
was removed by rotary evaporation. This was repeated 2x to remove excess 
unreacted acrylonitrile. The product was used without further purification 
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Diethyl 4,7,10-trioxatridecane-1 ,13-dioate (14) 2g (9.4mmol) 13, was 
dissolved in 5 mL ethanol. 1g cone, sulfuric acid was added slowly, over 5 minutes. 
The reaction was heated to reflux overnight. The reaction was extracted with 40 mL 
methylene chloride, washed once with 10 mL water and 3x with 10 mL dilute brine 
solution. The organic layer was dried (MgS0 4 ) and solvent was removed to yield an 
oil. The final product was purified by silica chromatography (methylene chloride/ethyl 
acetate). 

1 f 13-dihydroxy-4,7,10-trioxatridecane (15) Prepared exactly as described, 
using tetrahydrofuran as solvent. (1.7g, 5.5mmol 14, 50 mL distilled [CaH 2 ] THF, 
0.66g, 16.5mmol lithium aluminum hydride). Once addition was complete, excess 
LAH was quenched with ethanol, and the salts precipitated by dropwise addition of 
saturated sodium sulfate solution until a white precipitate formed. The solvent was 
removed, the precipitate washed 6x 30 mL with THF and the combined organic 
extracts were evaporated to yield an oil. Final product is purified by silica 
chromatography (first with methylene chloride then with ethyl acetate and finally with 
acetone). 

1,13-dichloro-4,7,10-trioxatridecane (analog using P2) (16) 1.1g, (4.9mmol) 
15 was added to 1.1 5g (14.6mmol) distilled pyridine in 30 mL dry benzene with 
stirring, followed by 1.8g (14.6mmol) thionyl chloride. The mixture was heated to 
reflux for 6 hours. After cooling in an ice bath, 5 mL 3M HCI was added with vigorous 
stirring. The organic layer was separated, washed 3x with a dilute brine solution, and 
dried (NaS0 4 ) to yield a yellowish oil. After washing and removal of solvent, the 
dichloride was used without further purification. 

Additional steps: 

1,13-dicyano-4,7,10-trioxatridecane) (17) To a stirred solution of 0.78g 
(15.5mmol) sodium cyanide in 4 mL dimethyl sulfoxide at 80°C was added 1g 
(3.9mmol) of 16. After 2 hours, the reaction was additioned with 10 mL of saturated 
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sodium chloride solution, 5 mL of water, and 50 ml_ ethyl acetate. The organic layer 
was washed 3x with a brine solution as before, after which the organic layer was 
dried (Na 2 S0 4 ) and the solvents removed. The final product was purified by silica 
chromatography (methlyene chloride/ethyl acetate). ESI-MS: predicted, 240.1; 
observed, 241.1 (M + H+) + . 

1,15-diamino-5,8,11-trioxapentadecane (18) A stirred solution of 50 mLdry 
THF containing 0.42g (10.4mmol) fresh LAH was heated to gentle reflux under 
argon for 15 minutes. 0.5g (2mmol) 17 in 15 mL dry THF was added dropwise over 
20 minutes, maintaining a gentle reflux. The unreacted LAH was quenched with 
ethanol, and the mixture was treated with dropwise addition of saturated sodium 
sulfate under efficient stirring until a white precipitate formed. The mixture was 
filtered, and the precipitate was washed 6x 30 mL with THF. The organic extracts 
were combined and the solvent was removed by rotary evaporation to yield an oil. 
ESI-MS: predicted, 248.1; observed, 249.1 (M + H+) + . 

Deuteration 

Deuterium has been incorporated into the diamine linker by reduction of 14 
and 17 using lithium aluminum deuteride (98% D) to achieve a d-8 deuterated 
diamine. No other aspects of the synthesis were changed for this procedure. These 
diols are used in the construction of the SFD conjugates as described later. 

Clinical Substrate Synthesis for Sanfilippo Syndrome, type B 
(N-a-D-glucosaminidase deficiency). 

13. p-Aminophenyl-a-D-N-acetylglucosamine (19) 20mg (0.07mmol) 
p-Nitrophenyl-a-D-N-acetylglucosamine (Sigma) was added to 5mg washed 
palladium catalyst on activated carbon in 3 mL methanol with stirring in a 5 mL 
septa-lined vial. The septum was pierced by a 16-gauge needle and the vial 
headspace was flushed with H 2 gas. H 2 gas was allowed to slowly bubble through 
the solution for 2 hours, after which the catalyst was removed by filtration over 
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diatomaceous earth (Celite). The solvent was removed by rotary evaporation to yield 
a semi-solid 18mg (90%). 

14. p-Acrylamidophenyl-a-D-N : acetylglucosamine (20) 10mg (0.03mmol) 19 was 
added to 15 mL methanol and 100 mcL triethylamine with stirring. The solution was 
chilled in an ice bath. 15mg (0.17mmol) acryloyl chloride was dissolved in 2 mL dry 
methylene chloride and added dropwise to the stirred solution over 5 minutes. The 
reaction was allowed to return to room temperature, followed by 2 hours of 
stirring. The solution was then treated with successive anion and cation exchange 
resins (AG MP-1 and AG MP-50, respectively, BioRad) until neutral pH was obtained 
with moist pH paper. Solvent was removed by rotary evaporation to yield a solid 

(1 1 mg, 95%). 1 H-NMR. Yield 1 1 mg. 

1 5. 3,6-dioxa-1 ,9-nonanedinitrile (21 ) 2g (0.032mol) ethylene glycol was 
added to 0.5 g dry potassium hydroxide in 30 mL dry benzene, followed by 5g 
(0.096mmol) acrylonitrile with stirring overnight at room temperature. The reaction 
was filtered and the solvent was removed by rotary evaporation to yield an oil. Final 
product was purified by silica chromatography (chloroform/methanol) to yield a 
colorless oil 3.2g (60%). 

16. 4,7-dioxa-1,10-decanediamine (22) Raney nickel (Aldrich) was washed 
five times with anhydrous methanol by inversion and decantation. 50mg of the 
washed catalyst was placed in 20 mL anhydrous methanol, followed by 1g (6mmol) 
21 in a 50 mL screw-cap vial fitted with a Teflon-lined rubber septum. The vial 
headspace was evacuated with H 2 gas via an 16-gauge needle piercing the septum. 
The cap was screwed on tightly and the entire assembly was charged to 40psi H 2 
and placed in a hot water bath (80°C) for 4 hours, after which the solid catalyst was 
removed by filtration and the methanol evaporated. The final product was purified by 
reverse-phase HPLC (Vydac C-18 prep-scale column, 6 mL/min. Mobile phase: H 2 0 
(0.08%TFA)/ACN (0.08%TFA)). 
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1 7. Michael addition product of 20 and 22 (23) 5mg (0.01 5mmol) 20 was added 
to a stirred solution of 13mg (0.06mmol) 22 in 5 ml_ 0.2M sodium carbonate, pH 10.5 
at 37°C. The reaction was allowed to proceed for 3 days, after which the solution 
was neutralized with dilute trifluoroacetic acid and purified by reverse-phase HPLC 

5 (Vydac C-18 prep-scale column, 6 mL/min. Mobile phase: H 2 0 (0.08%TFA)/ ACN 
(0.08%TFA)). Yield 6 mg. 

1 8. SFB substrate conjugate of 4 and 23 (24) A 4mg (0.01 3mmol) quantity 

of 4 was dissolved in 1.5 mL anhydrous DMF with stirring, under argon atmosphere. 

10 mcL dry triethylamine was added, followed by 4mg (0.015mmol) 1. The formation 
10 of active ester was monitored by silica TLC (5:1 CHCI 3 /CH 3 OH, Rf 0.5, UV) by briefly 
^ drying the spotted TLC plate with a stream of air. After 25 minutes, the mixture was 

ffl added to 6mg (0.012mmol) 23 in 1 mL anhydrous DMF. After 2 hours, the solvent 

gB was removed by vacuum centrifugation and the final product was purified by 

% reverse-phase HPLC (Vydac C-18 prep-scale column, 6 mL/min. Mobile phase: H 2 0 

*1 5 (0.08%TFA)/ ACN (0.08%TFA)). Yield 4.2 mg. 

J 19. 1 ,9-tetradeutero-3,6-dioxa-1 ,9-nonanedinitrile (25) 0.5g (8mmol) ethylene 

O glycol was added to 0.1 g dry potassium hydroxide in 20 mL acetonitrile, followed by 

tl 1 .4g (24mmol) d-3 acrylonitrile with stirring overnight at room temperature. The 

reaction was filtered and the solvent was removed by rotary evaporation to yield an 
20 oil. Final product was purified by silica chromatography (chloroform/methanol) to 

yield a colorless oil 0.9g (65%). 

20. 1 ,9-tetradeutero-3,6-dioxa-1 ,9-nonanediamine (26) Raney nickel (Aldrich) 
was washed five times with anhydrous methanol by inversion and decantation. 20 
mg of the washed catalyst was placed in 30 mL anhydrous methanol, followed by 
25 0.5g (3mmol) 25 in a 50 mL screw-cap vial fitted with a Teflon-lined rubber septum. 
The vial headspace was evacuated with H 2 gas via an 18-gauge needle piercing the 
septum. The cap was screwed on tightly and the entire assembly was charged to 
40psi H 2 and placed in a hot water bath (80°C) for 4 hours, after which the solid 
catalyst was removed by filtration and the methanol evaporated. The final product 
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was purified by reverse-phase HPLC (Vydac C-18 prep-scale column, 6 mL/min. 
Mobile phase: H 2 0 (0.08%TFA)/ ACN (0.08%TFA)). 

21 . Deuterated analog of 23 (27) 20mg (0.07mmol) p-acrylamidophenyl 
p-D-galactoside was added to a stirred solution of 90mg (0.4mmol) 26 in 5 ml_ 0.2M 

5 sodium carbonate, pH 10.5 at 37°C. The reaction was allowed to proceed for 3 days, 
after which the solution was neutralized with dilute trifluoroacetic acid and purified by 
reverse-phase HPLC (Vydac C-18 prep-scale column, 6 mL/min. Mobile phase: H 2 0 
(0.08%TFA)/ ACN (0.08%TFA)). Yield 2 mg. 

22. Deuterated analog of 24 (28) A 2mg (6.3mcmol) quantity of 4 was dissolved 
in 1 .5 mL anhydrous DMF with stirring, under argon atmosphere. 5 mcL 
triethylamine was added, followed by 2.1mg (7.6mcmol) 1 . The formation of active 
ester was monitored by silica TLC (5:1 CHCI 3 /CH 3 OH, Rf 0.5, UV) by briefly drying 
the spotted TLC plate with a stream of air. After 35 minutes, the mixture was added 
to 4mg (7mcmol) 27 in 1 mL anhydrous DMF. After 2 hours, the solvent was 
removed by vacuum centrifugation and the final product was purified by 
reverse-phase HPLC (Vydac C-18 prep-scale column, 6 mL/min. Mobile phase: H 2 0 
(0.08%TFA)/ ACN (0.08%TFA)). Yield 1.2 mg. 

23. SRB internal standard conjugate (29) 1 .2mg 28 was added to 2 mL of 100mM 
Tris/10mM K4gCI2, pH 7.3 buffer with stirring. 15 units recombinant 
(3-D-galactosidbse (Sigma) was added, and after 12 hours the mixture was purified 
by reverse-phase HPLC (Vydac C-18 prep-scale column, 6 mL/min. Mobile phase: 
H 2 0 (0.08%TFA)/ACrHq ; 08%TFA)). Yield 0.7 mg. 

Clinical Substrate Synthesis for Sanfilippo Syndrome, type D (a sulfatase 
deficiency). 

25 24. p-Acrylamidophenyl-a-D-N-acetylglucosamine-6-sulfate (30) 100mg 

(0.28mmol) 20 was added to 10 mL dry DMF under argon atmosphere with stirring 
at room temperature. 89mg (0.56mmol) sulfur trioxide-pyridine complex was 
dissolved in 2 mL dry DMF and was added to the reaction in 0.7x, 1.1x, 1.3x and 

51 




1.9x equivalents (+700 mcL, +400 mcL, +200 mcL, and +600 mcL). The reaction 
progress was monitored by 1 H-NMR shift of the anomeric (C1 ) proton chemical shift 
from 5.29 to 5.24 ppm by removal of 1 5 mcL of solution 1 hour after addition of 
each amount of sulfating reagent. The removed mixture was dried by vacuum 
5 centrifugation and redissolved in d-6 DMSO and analyzed. Upon the appearance of 
more than two forms (starting material and C-6 sulfate) of the C1 anomeric proton, 
the reaction was removed to -20°C and stored. The product was purified by vacuum 
centrifugation to remove solvent, followed by reverse-phase HPLC (Vydac C-18 
prep-scale column, 6 mL/min. Mobile phase: H 2 0 (0.08%TFA)/ ACN (0.08%TFA)). 
10 Yield 72%. 

UJ 

jO 25. Michael addition product of 18 and 30 (31) 25mg (0.058mmol)30 was added 
Q to a stirred solution of 83mg (0.35mmol) 18 in 5 mL 0.2M sodium carbonate, pH 10.5 
^ at 37°C. The reaction was allowed to proceed for 3 days, after which the solution 
CO was neutralized with dilute trifluoroacetic acid and purified by reverse-phase HPLC 
fs (Vydac C-18 prep-scale column, 6 mL/min. Mobile phase: H 2 0 (0.08%TFA)/ ACN 
O (0.08%TFA)). Yield 10 mg. 

5 26. SFD substrate conjugate of 4 and 31 (32) A 5.7mg (0.018mmol) quantity of 

^ 4 was dissolved in 1.0 mL anhydrous DMF with stirring, under argon atmosphere. 20 
mcL dry triethylamine was added, followed by 5.5mg (0.020mmol) 1 . The formation 

20 of active ester was monitored by silica TLC (5:1 CHCI 3 /CH 3 OH, Rf 0.5, UV) by briefly 
drying the spotted TLC plate with a stream of air. After 25 minutes, the mixture was 
added to 10mg (0.015mmol) 31 in 1 mL anhydrous DMF. After 2 hours, the solvent 
was removed by vacuum centrifugation and the final product was purified by 
reverse-phase HPLC (Vydac C-18 prep-scale column, 6 mL/min. Mobile phase: H 2 0 

25 (0.08%TFA)/ ACN (0.08%TFA)). Yield 5.4 mg. 

27. 1,2,14,15-octadeutero-1,15-diamino-5,8,1 1-trioxapentadecane (33) as 
referenced in Polyether Diamine Linker Synthesis, Second Generation. 
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28. Deuterated analog of 31 (34) 25mg (0.07mmol) 20 was added to a stirred 
solution of 1 0Omg (0.4mmol) 11 in 5 mL 0.2M sodium carbonate, pH 1 0.5 at 37°C. 
The reaction was allowed to proceed for 3 days, after which the solution was 
neutralized with dilute trifluoroacetic acid and purified by reverse-phase HPLC 
(Vydac C-18 prep-scale column, 6 mL/min. Mobile phase: H 2 0 (0.08%TFA)AACN 
(0.08%TFA)). Yield 7 mg. 

29. SFD internal standard conjugate (35) A 4mg (12.6mcmol) quantity of 4 was 
dissolved in 1 mL anhydrous DMF with stirring, under argon atmosphere. 20 mcL 
triethylamine was added, followed by 4mg (14mcmol) 1 . The formation of active 
ester was monitored by silica TLC (5:1 CHCI 3 /CH 3 OH, Rf 0.5, UV) by briefly drying 
the spotted TLC plate with a stream of air. After 20 minutes, the mixture was added 
to 7mg (1 Imcmol) 34 in 1 mL anhydrous DMF. After 4 hours, the solvent was 
removed by vacuum centrifugation and the final product was purified by 
reverse-phase HPLC (Vydac C-18 prep-scale column, 6 mL/min. Mobile phase: H 2 0 
(0.08%TFA)/ ACN (0.08%TFA)). Yield 2. 7 mg. 

N-(d-Biotinyl-sarcosinyl)-12-aminododecanoic acid (36). Compound 4 (32.2 
mg, 0.102 mmole) was dried overnight in vacuo (with P 2 O s ). Dry DMF (2 mL) was 
added and the mixture was stirred with warming to affect dissolution under nitrogen. 
Triethylamine (34 mcL) was added followed by 1 (20.4 mcL, 0.115 mmole) added 
in two 10.2 mcL portions, 5 min apart. The mixture was stirred for 1 hr at room 
temperature under nitrogen. 12-Aminododecanic acid (24.1 mg, 0.112 mmole, 
Sigma) was added in one portion, and the mixture was stirred at room temperature 
for 2 hr under nitrogen. CHCI 3 (80 mL) was added, and the organic solution was 
washed with two 10 mL portions of 1 M HCI. CHCI 3 was removed by rotary 
evaporation, and residual DMF was removed by vacuum centrifugation. The 
compound was dissolved in methanol and purified by HPLC (Vydac 218TP prep 
column). Solvent program is: 0-10 min, water with 0.06% TFA; 10-55 min, 0-100% 
methanol with 0.06% TFA, flow rate is 6 mL/min. Yield 31 .7 mg. 1 H-NMR. ESI-MS, 
calculated 513.4, observed 513.4 (M + H) + 
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N-hydroxysuccinimidyl ester of 36 (37). Compound 36 (9.8 mg, 19 mcmole) 
is dissolved in 100 mcL of dry DMF under nitrogen. N-hydroxysuccinimide (2.2 mg, 
19 mcmole) was added followed by dicyclohexylcarbodiimide (3.9 mg, 19 mcmole). 
The mixture was stirred at room temperature for 60 h in the dark. Solvent was 
5 removed by vacuum centrifugation, and the residue was submitted to flash 
chromatography on silica gel using a gradient of CHCI 3 /CH 3 OH (15/1) to 
CHCI3/CH3OH (12/1). Yield 9.8 mg. 1H-NMR. ESI-MS, calculated 610.8, observed 
609.7 (M+H) + . 

N-(N-(d-Biotinyl-sarcosinyl)-12-aminododecanoyl)-pyschosine (38). 
10 Compound 37 (6.2 mg, 10 mcmole) and pyschosine (4.7 mg, 10 mcmole, Sigma) 
%p were dissolved in 200 mcL of dry DMF under nitrogen. Diisoproylethylamine (5 mcL) 
y was added, and the mixture was stirred under nitrogen for 2 days in the dark. The 
compound was injected directly onto the HPLC column (Vydac 218TP semi-prep), 
CD and the column was developed at 2 mL/min with 0-20 min, water with 0.06% TFA, 
f5 then 20-80 min, 0-100% methanol with 0.06% TFA. Yield 3.8 mg. 1H-NMR. 
Q ESI-MS, calculated 957.3, observed 956.8 (M + H) + . 

g N-(N-(d-Biotinyl-sarcosinyl)-12-aminododecanoyl)-sphingosylphosphorylcholine 

N 8 (39). Sphingosylphosphorylcholine (4.0 mg, Sigma) was mixed with 1 mL dry DMF 
and solvent was removed by vacuum centrifugation. This was repeated two more 

20 times. The final dried residue weighed 2.5 mg (5.4 mcmole). To this residue was 
added 3.3 mg of 37 (5.4 mcmole), 150 mcL of dry DMF, and 2.5 mcL of 
diisoproylethylamine. The mixture was stirred under nitrogen in the dark for 3 days. 
The compound was injected directly onto the HPLC column (Vydac 218TP 
semi-prep), and the column was developed at 2 mL/min with 0-20 min, water with 

25 0.06% TFA, then 20-80 min, 0-100% methanol with 0.06% TFA. Yield 3.8 mg. 
1H-NMR. ESI-MS, calculated 960.3, observed 958.7 (M + H) + . 

Conjugate of d-biotin with 1,13-diamino-4,7,10-trioxatridecane (40). 
Compound 2 was reacted with 1 ,13-diamino-4,7,10-trioxatridecane (Fluka) 
essentially as described for the synthesis of 3. The product was purified by HPLC 
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(Vydac 218TP, semi-prep) using 0-100% methanol with 0.06% TFA over 30 min at 
1.5 mL/min. 

lodoacetylated 40 (41). Compound 40 was treated with 5 equivalents of 
iodoacetic anhydride (Aldrich) in dry DMF with stirring under nitrogen for 4 h at room 
temperature. The product was purified on HPLC as for 40. The structure was 
confirmed by ESI-MS. 

Octadeuterated 41 (42). The title compound was prepared as for the 40 
using 9 instead of 1,13-diamino-4,7,10-trioxatridecane. 

Octadeuterated 42 (43). The title compound was prepared from 42 as for 41 . 
The structure was confirmed by ESI-MS. 
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Exemplary MS N Techniques and Instrumentation 

\n automated LC-MS/MS system for the identification of proteins by their 
amino acid sequence has been developed. A schematic representation is shown in 
Fig. 7 Thevsystem, which consists of an autosample, a capillary HPLC system 
5 connected oto-line to an ESI triple quadrupole MS/MS instrument and a data system 
is operated in the following way: Proteins (typically separated by 1D or 2D gel 
electrophoresis) Tare cleaved with a specific protease, usually trypsin, the resulting 
cleavage fragment^ are placed in an autosampler. Every 37 minutes the 
autosampler injects one sample into the HPLC system and the peptides are 
10 separated by capillary reverse-phase chromatography. As separated peptides elute 
^ from the chromatography\column, they are ionized by the ESI process, enter the MS 
%y and the mass to charge ratio (m/z) is measured. Any peptide ion whose intensity 
2 exceeds a predetermined intensity threshold is automatically selected by the 
^ instrument and collided in the collision cell with inert gas. These collisions result in 
(E5 peptide fragmentation, primarily at the bonds of the peptide backbone (collision 
7" induced dissociation, CID). The masses of the CID fragments are measured and 
y recorded in the data system. The Cl^\spectrum of a peptide contains sufficient 
[U information to identify the protein by seaVching sequence databases with the 
S uninterpreted MS/MS spectra. This is accomplished with the Sequent program. The 
J?f) program identifies each peptide in a sequence database which has the same mass 
as the peptide that was selected in the MS for 6JD and predicts the MS/MS 
spectrum for each one of the isobaric peptides. By matching the experimentally 
determined CID spectrum with computer generated\heoretical CID spectra, the 
protein from which the observed peptide originated is identified. The system is 
25 capable of analyzing protein samples in a fully automated fashion at a pace of less 
than 40 min. per sample. Since each peptide represents ato independent protein 
identification and usually multiple peptides are derived from one protein, protein 
identification by this method is redundant and tolerant to proteihs co-migrating in a 
gel. The system is well suited for the detection and characterization of modified 
30 residues within polypeptide chains. The LC-MS/MS technique anaSautomated 

analysis of the generated CID spectra can be used for the methods of this invention. 
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Identification of proteins at sub-femtomole sensitivity by solid-phase 

extraction capillary electrophoresis tandem mass spectrometry (SPE- 
CE-MS/MS) 

^Protein identification by this method is based on the same principle as 
described above, except that peptide separation and ionization are performed at 
significantly higher sensitivity. Fig. 8 shows a schematic representation of the key 
design elements. The design of the system and its mode of operation have been 
published. Peptides derived from protein digests are concentrated by SPE, 
separated by CE^and analyzed by ESI-MS/MS. The resulting uninterpreted CID 
spectra are used tcrsearch sequence databases with the Sequest software system. 
The SPE extraction device is a small reversed-phase chromatography column of the 
dimensions 0.18 x 1 mmSwhich is directly packed in a fused silica separation 
capillary. Peptides contained in a sample solution are adsorbed and concentrated 
on the SPE device, eluted inWi estimated 100 - 300 nl of organic solvent and further 
concentrated by electrophoreticystacking and/or isotachophoresis to an estimated 
volume of 5-30 nl. The peptides sare then separated by CE in a 20 pm or 50 pm i.d. 
capillary and directly ionized by ESl\as they leave the capillary. With this system, 
peptide masses can be determined ala sensitivity of 660 attomoles (approx. 500 fg 
for a 20 residue peptide) at a concentration limit of 33 amol/pl and that proteins can 
be identified by the CID spectra of automatically selected peptides at less than 10 
fmol (0.5 ng for a protein of 50 kDa) of sample at a concentration limit of less than 
300 amol/pl. this technique is used for the analysis at very high sensitivity of the 
peptide samples generated by the experiments\ It has also been demonstrated that 
the analysis time available for automated CID experiments can be significantly 
extended by data-dependent modulation of the CE Voltage. If several peptide ions 
are detected coincidentally in the MS, the CE voltage\is automatically dropped. This 
results in a reduction of the electroosmotic flow out of the capillary and therefore in 
an extension of the time period available for selecting peptide ions for CID. The net 
effect of this peak parking technique is an extension of theVlynamic range of the 
technique because the increased time available is used for QID of ions with a low ion 
current Once all the peptide ions are analyzed, electrophoresis is automatically 
reaccelerated by increasing the CE voltage to the original value\ 
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JLE 1 . Relative, redundant quantitation of a-lactalbumin abundance (after mixing with 
kn^wn amount of the same protein with cysteines modified with isotopically heavy 
lylating reagent) 



Peptide 
# \ 


m/z 
(light) 


Charge 
state 


Peptide Sequence 


Ratio 
(heavy: light) 


1 


^18.4 


2+ 


(K) IWCK 


2.70 


2 


56(8.4 


2+ 


(K) ALCSEK (SEQ ID NO: 2) 


2.68 


3 


57(^4 


2+ 


(K) CEVFR (SEQ ID NO:3) 


2.90 


4 


760.5\ 


2+ 


(K) LDQWLCEK (SEQ ID NO:4) 


2.82 


5 


710.1 


\ 3+ 


(K) FLDDDLTDDIMCVK (SEQ ID NO:5) 


2.88 


6 


954.2 


\ 3+ 


(K) DDQNPHSSNICNISCDK (SEQ ID 
NO:6) 


2.90 


7 


1286.9 


4 \ 


(K) GYGGVSLPEWVCTTFHTSGYDT 

QAIVQNNDSTEYGLFQINNK (SEQ 
IDNO:7) 
\(SEQ ID NO. 


NA a 



10 



a Isotope ratio was not analyzed Isecause on a 4" peptide the isotope patterns were highly 
overlapping due to differences of only 2 amu between heavy and light ions. 
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TABLE 2. Sequence identification and quantitation of the components of a protein mixture in 
a single analysis. 



Gene Name* 


Peptide sequence identified 


Observed 

ratio 
(d0/d8) f 


Mean±SD 


Expected 

ratio 
(d0/d8) _ 


% 
error 


LCA_BOVIN 


ALC#SEK (SEQ ID NO:8 
C#EVFR 

FLDDLTDDIMC#VK (SEQ 
ID NO:9) 


0.94 
1.03 
0.92 


0.96±0.06 


1.00 


4.2 


OVAL_CHICK 


ADHPFLFC#IK (SEQ ID 
NO: 10) 

YPILPEYLQC#VK (SEQ ID 
NO: 11) 


1.88 
1.96 


1.92±0.06 


2.00 


4.0 


BGALECOLI 


LTAAC#FDR (SEQ ID 
NO: 12) 

IGLNC#QLAQVAER (SEQ 
ID NO: 13) 

HFDGVNSAFHLWC#NGR 
(SEQ ID NO: 14) 


1.00 
0.91 

1.04 


0.98±0.07 


1.00 


2.0 


LACBBOVTN 


WENGEC#AQK (SEQ ID 
NO: 15) 

LSFNPTQLEEQC#HI (SEQ 
IDNO:16) 


3.64 
3.45 


3.55±0.13 


4.00 


11.3 


G3P_RABIT 


VPTPNVSWDLTC#R 
(SEQ ID NO: 17) 
IVSNASC#TTNC#LAPLAK 
(SEQ ID NO: 18) 


0.54 
0.57 


0.56±0.02 


0.50 


12.0 


PHS2_RABIT 


IC#GGWQMEEADDWLR 
(SEQ ID NO: 19) 
TC#AYTNHTVLPEALER 
(SEQ ID NO:20) 
WLVLC#NPGLAEEAER 
(SEQIDNO:21) 


0.32 
0.35 
0.30 


0.32±0.03 


0.33 


3.1 



* Gene names are according to Swiss Prot nomenclature ( www.expasv.chV 
+ Ratios were calculated for each peptide as shown in Fig. 3. 

= Expected ratios were calculated from the known amounts of proteins present in each mixture. 

# ICAT-labeled cysteinyl residue. 
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TABLE 3. Protein profiles from yeast growing on galactose or ethanol as a carbon source. 



Gene 
name* 


Peptide sequence identified 


Observed 

ratio* 
(Eth:Gal) _ 


Galactose- 
repressed 6 


Glucose- 
repressed 8 




ACH1 


KHNC#LHEPHMLK (SEQ ID NO:22) 


>100 : 1 






ADH1 


YSGVC#HTDLHAWHGDWPLPVK (SEQ 
ID NO: 23) 

C#C#SDVFNQWK (SEQ ID NO:24) 


0.57 : 1 
0.48 : 1 






ADH2 


YSGVC#HTDLHAWHGDWPLPTK (SEQ 
ID NO:25) 

C#SSDVFNHVVK (SEQ ID NO:26) 


>200 : 1 
>200 : 1 






ALD4 


TFEVINPSTEEEIC#HIYEGR (SEQ ID 
NO:27) 


>100 : 1 






BMH1 


SEHQVELIC#SYR (SEQ ID NO:28) 


0.95 : 1 






CDC 19 


YRPNC#PIILVTR (SEQ ID NO:29) 
NC#TPKPTSTTETVAASAVAAVFEQK 
(SEQ ID NO:30) 
AC#DDK 


0.49 : 1 
0.65 : 1 
0.67 : 1 






FBA1 


SIAPAYGIPVVLHSDHC#AK (SEQ ID 
NO:31) 

EQVGC#K (SEQ ID NO:32) 


0.60 : 1 
0.63 : 1 






GAL1 


LTGAGWGGC#TVHLVPGGPNGNIEK 
(SEQ ID NO:33) 


1 : >200 






GAL 10 


HHIPFYEVDLC#DR (SEQ ED NO:34) 
DC#VTLK (SEQ ED NO: 35) 


1 : >200 
1 : >200 






GCY1 


LWC#TQHHEPEVALDQSLK (SEQ ED 


0.34 : 1 






GLK1 


IC#SVNLHGDHTFSMEQMK (SEQ ID 

J / ) 


0.65 : 1 






urJJ l 




0 54 • 1 




s 


TPT 1 




>1 00 • 1 

1 \J\J . L 






TOO 1 

Lrr l 


Mr'^FPMMOVTT-TNrvn A FPOTWFDPNV^- 
1n v^Trrrririvj I -LTTin i KJr\r r\£ i vv lz,lj± in v o 

HPETK (SEQ ED NO:40) 


0 76 • 1 






LPDl 


VC#H AHPTL SEAFK (SEQ ED NO:41) 


1.30 : 1 






PEP4 


KGWTGQYTLDC#NTR (SEQ ED NO:42) 


2.60 : 1 






PSA1 


SWLC#NSTEK (SEQ ED NO:43) 


0.56 : 1 
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PGM2 


C#TGGIILTASHNPGGPENDMGIK (SEQ 
ID NO:44) 

LSIC#GEESFGTGSNHVR (SEQ ID 
NO:45) 


0.58 : 
0.62 : 


1 
1 


y 


PCK1 


C#PLK 

IPC#LADSHPK (SEQ ID NO:46) 
C#ENLSAEKEPEIFDAIK (SEQ ID NO:47) 
r#AYPTDYTPSAK fSEO TD N0481 
1VEEPTSKDEIWWGPVNKPC#SER 
(SEQIDNO:49) 


1.59 : 
1.47 : 
1.52 : 
1.41 : 
1.85 : 


1 
1 
1 
1 

1 


s 


QCR6 


ALVHHYEEC#AER TSEO ID NO 50) 


1.30 : 


1 


/ 


RPL1A 11 


SC#GVDAMSVDDLKK (SEQ ID NO:5 1) 


0.82 : 


1 




SAH1 


HPEMLEDC#FGLSEETTTGVHHLYR 
(SEQ ID NO:52) 

EC#INIKPQVDR (SEQ ID NO:53) 


0.62 : 
0.74 : 


1 
1 




SOD1 


GFHIHEFGDATNGC#VSAGPHFNPFK 
(SEQ ED NO:54) 


0.46 : 


1 


s 


TEF1 


RGNVC#GDAK (SEQ ID NO:55) 
C#GGDDK (SEQ ID NO:56) 
FVPSKPMC#VEAFSEYPPLGR CSEO ID 
NO:57) 


0.81 : 
0.70 : 
0.74 : 


1 
1 

: 1 




VMA2 


E?IFSASGLPHNEIAAQIC#R (SEQ ID 
NO:58) 


0.70 : 


1 




YHB1 


HYSLC#SASTK (SEQ ID NO:59) 


0.69 : 


: 1 





* Gene names are according to the Yeast Proteome Database (YPD) (19). 

# Cysteinyl residue is ICAT-labeled. 

+ Protein expression ratios were calculated as described in Fig. 3. 

= Carbon source for yeast growth was 2% ethanol (Eth) or 2% galactose (GAL0. 

§ Gene is known to be galactose- or glucose-repressed (19). 

11 Eight other ribosomal proteins were detected at similar gene expression levels. 
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TABLE -4 



[Z 



Disease 



Enzyme 



Dysfunction 



Butyrylcholinesterase deficiency 


BCHE 


Decreased or absent 
enzyme activity 


Essential fructosuria hepatic 
fructokinase deficiency 


Fructokinase 


Deficient enzyme activity 


Hereditary fructose intolerance 


Fructose 1,6- 
bisphosphate aldolase B 


Deficient enzyme activity 


Hereditary fructose 1 ,6- 
diphosphatase deficiency 


Fructose 1 ,6- 

hicnhnQnh Ata^P 
UloUl lUojJi i o iaoc 


Deficient enzyme activity 


Erythrocyte aldolase deficiency 
with nonspherocytic hemolytic 
anemia (aldolase A deficiency) 


Fructose 1,6- 

hicnhncnhafp alrinl^^P A 


Deficient enzyme activity 


Glycogen storage disease type 
la (von Gierke disease) 


/-*h i/>acq R.nhncnhstp^p 
OiUCQoc D pi HJo[JI Icalcjoc 


Absent or deficient 
enzyme activity 


Glycogen storage disease type 
lb 


oiucose o-privjopiiciic 
translocase 


npfiripnt transoort of 
glucose 6-phosphate 
across the membrane of 
endoplasmic reticulum 


Glycogen storage disease type 
III 


Amylo-1, 6-glucosidase 
(debrancher enzyme) 


Absent or deficient 
enzyme activity 


Glycogen storage disease type 
IV (Andersen disease) 


Ct- 1 , *r giUCal l-U-U- 

glucosyltransferase 


npfiripnt enzvme activitv 


Glycogen storage disease type 
V (McArdle disease) 


Muscle glycogen 
phosphorylase 


Ah<;pnt or deficient 
enzyme 


Glycogen storage disease X- 
linked phosphorylase kinase 
deficiency 


Knospnoryiase u-Kinaoo 


npfiripnt or absent 
enzyme activity function 


Glycogen storage disease 
autosomal phosphorylase 
kinase deficiency 


Phosphorylase 6-kinase 


Deficient enzyme activity 


Glycogen storage disease liver 
phosphorylase deficiency 


Liver phosphorylase 


Deficient enzyme activity 


Glycogen storage disease type 
VII (Tarui disease) 


Muscle 
phosphofructokinase 1 


Deficient enzyme activity 


Liver glycogen synthase 
deficiency 


Liver glycogen synthase 


Unknown 
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TABLE -4 



Disease 



Enzyme 



Dysfunction 



Phosphoglycerate kinase 
deficiency 


Phosphoglycerate kinase 


Deficient enzyme 


Phosohoalvcerate mutase 
deficiency 


Phosphoglycerate 
mutase 


Deficient enzyme 


Muscle lactate dehvdroqenase 
deficiency 


Muscle-specific subunit of 
lactate dehydrogenase 
(LDH) 


Absence of M subunit of 
LDH. Muscle LDH is a 
tetramer of the heart- 
specific subunit 


r^h irncip nhn<;nhate isomerase 
deficiency 


Glucose phosphate 
isomerase 


Unknown 


Tran^fprase deficiencv 
galactosemia 


Galactose 1 -phosphate 
uridyltransferase 


Deficient enzyme activity 


f^alantokina^e deficiencv 
galactosemia 


Galactokinase 


Deficient enzyme activity 


Fnimpra<;p dpficiencv 

galactosemia 


Uridine diphosphate 
galactose-4-epimerase 


Deficient enzyme action 

in blood cells only 
(benign) or, more rarely, 
in all tissues (generalized) 


Phenylketonuria (PKU) due to 
PAH deficiency 


Phenylalanine 
hydroxylase (PAH) 


Deficient or absent PAH 
activity (<1% normal) 


Mwn^rnhpnvlal^ninemia due to 
DHPR-deficiency 


Dihydropteridine 
reductase (DHPR) 


Deficient or absent DHPR 
activity 


Hyperphenylalaninemia due to 

r^TP-f^H-HpfiHpnrv 


Guanosine triphosphate 
cyclohydrolase (GTP-CH) 


Deficient enzyme activity 


Hyperphenylalaninemia due to 
6-PTS-deficiency 


6-Pyruvoyl 
tetrahydropterin synthase 
(6-PTS) 


Deficient enzyme activity 


Oculocutaneous tyrosinemia 
(tyrosinemia type II; tyrosine 
amino-transferase deficiency) 


Tyrosine 
aminotransferase 


Decreased activity 


4-Hydroxyphenylpyruvic acid 
dioxygenase (tyrosinemia type 
III) 


4-Hydroxy-pheny I pyruvic 
acid dioxygenase 


Decreased activity 


Maleylacetoacetate isomerase 
deficiency (tyrosinemia type lb) 
(tentative) 


Maleylacetonacetate 
isomerase 


Presumably decreased 
enzyme activity 
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TABLE -4 



Disease 1 


Enzyme 1 


Dysfunction | 






Hepatorenal tyrosinemia 
(tyrosinemia type 1: 
fumarylacetoacetate hydrolase 
deficiency) 


Pi irn^rvl^pf^tnacetate 

rui I id i y lauciuao^ 

hydroxylase 


Deficient enzyme activity 


Carbamyl phosphate synthetase 
deficiency 


Carbamyl phosphate 
synthetase 1 


Absent or deficient 
enzyme activity 


Ornithine transcarbamylase 
deficiency 


Ornithine 
transcarbamylase 


Absent or reduced 
enzyme activity 


Argininosuccinic acid 
synthetase deficiency 


Argininosuccinic acid 
synthetase 


Deficient enzyme activity 


Argininosuccinase deficiency 


Argininosuccinate lyase 


Deficient enzyme activity 


Arginase deficiency 


Liver arginase 


Deficient enzyme activity 


raminai nypt?iiyointjiiuca ^vanaiu. 
saccharopinuria) 


a-Aminoadipic 
semialdehyde synthase 


Deficient enzyme activity 


Maple syrup urine disease 
(MSUD) or branched chain 
ketoacidemia 


Branched-chain a-keto 
acid dehydrogenase 


Deficient or absent (<2%) 
BCKAD complex activity 

in mitocnonarta, 
immunologic absence or 
reduced levels of enzyme 
subunits; impairment of 
E1 subunit assembly 


Cystathionine p-synthase 
ripficiencv 


Cystathionine p-synthase 


Deficient enzyme activity 


rf-Cv«;tathionase deficiency 


cc-Cystathionase 


Deficient enzyme activity 


Hepatic methionine 
adenosyltransferase deficiency 


Isoenzyme of methionine 
adenosyltransferase 


Deficient enzyme activity 


Sarcosinemia 


Sarcosine 
dehydrogenase? 


Deficient enzyme activity 


Nonketotic hyperglycinemia 


Glycine cleavage system 


Deficient enzyme activity 


Hyperuracil thyminuria 


Dihydropyrimidine 
dehydrogenase 


Deficient enzyme activity 


Dihydropyrimidinuria 


Dihydropyrimidinase 


Unknown 


Pyridoxine dependency with 
seizures 


Brain glutamic acid 
decarboxylase-1 


Deficient coenzyme 
binding? (brain) 
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TABLE -4 



Disease 1 


Enzyme I 


Dysfunction 




GABA aminotransferase 
deficiency 


GABA-cc-ketoglutarate 
transaminase 


Deficient enzyme activity 


4-Hydroxybutyric aciduria 


Succinic semialdehyde 
dehydrogenase 


Deficient enzyme activity 


Serum carnosinase deficiency 
and homocarnosinosis 


Serum carnosinase 


Deficient enzyme 


Alkaptonuria 


Homogentisic acid 
oxidase 


Absent or deficient 
enzyme activity 


Isovaleric acidemia 


Isovaleryl-CoA 
dehydrogenase 


Deficient enzyme activity, 
uencieni proiciii, 
abnormal peptide size 


Isolated 3-methylcrotonyl-CoA 
carboxylase deficiency 


3-Methylcrotonyl-CoA 
carboxylase 


Deficient enzyme activity 


3-Methylglutaconic aciduria 
Mild form: 


3-Methylglutaconyl-CoA 
hydratase 


Deficient enzyme activity 


3-Hydroxy-3-methylglutaryl-CoA 
lyase deficiency 


o nyui uAy j 

methylglutaryl-CoA lyase 


Deficient enzyme activity 


Mevalonic aciduria 


Mevalonate kinase 


Deficient enzyme activity 


Mitochondrial acetoacetyl-CoA 
thiolase deficiency 


Mitochondrial 
acetoacetyl-CoA thiolase 
(T2) 


Deficient enzyme activity, 
decreased protein, 
unstable protein 


Propionic acidemia (2 nonallelic 

fnrmc HoctnnatpH nc^C'A 3nd 

pccBC) 


Propionyl-CoA 
carboxylase (PCC) 


Deficient enzyme activity 
(nonallelic forms reflect 
mutations in nonidentical 
subunits of PCC) 


Methylmalonic acidemia (2 
allelic variants designated mut° 
and mut 


Methyldmalonyl-CoA 
mutase (MUT) 
apoenzyme 


Absent MUT activity in 

mut , deficient mu i 
activity due to reduced 

affinity for cofactor 
(adenosylcobalamin) in 
mut 


Glutaric acidemia type 1 


Glutaryl-CoA 
dehydrogenase 


Deficient enzyme activity 


Cytochrome oxidase deficiency 


Cytochrome oxidase 
polypeptides 


Decreased activity of the 
cytochrome oxidase 
complex 
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TABLE -4 



Disease 



Enzyme 



Dysfunction 



Pyruvate dehydrogenase 

complex deficiency^ 
decarboxylase component 


Pyruvate decarboxylase, 
E,a 


uecreasea enzyme 
activity, decreased 
protein 


Pyruvate dehydrogenease E 2 
transacylase 


Dihydrolipoamide 
transacylase 


Decreased enzyme 
activity; abnormal protein 
electrophoretic mobility 


Combined cc-ketoacid 

dehydrogenase 
deficiency/lipoamide 
dehydrogenase deficiency 


Lipoamide 
dehydrogenase 


Decreased enzyme 
activity 


Pyruvate carboxylase deficiency 


Pyruvate carboxylase 


Absent enzyme activity; 7 
cases absent enzyme, 
protein, and mRNA 


Carnitine palmitoyl transferase 1 
(CPT 1) deficiency 


Carnitine palmitoyl 
transferase 1 


Deficient enzyme 


Carnitine/acylcarnitine 
translocase deficiency 


Carnitine/ acylcarnitine 
translocase 


Deficient translocase 


Parnitinp nalmitovl transferase II 
(CPT II) deficiency 


Carnitine palmitoyl 
transferase II 


Deficient enzyme 


Vprv Inna-chain acvl-CoA 
dehydrogenase (VLCAD) 
deficiency 


Very long-chain acyl-CoA 
dehydrogenase 


Deficient enzyme 


Long-chain acyl-CoA 
dehydrogenase (LCAD) 


Long-chain acyl-CoA 
dehydrogenase 


Deficient enzyme 


1 nnn-rhain L-3-hvdroxvacvl- 
CoA dehydrogenase (LCHAD) 
deficiency 


L-3-hydroxyacyl-CoA 
dehydrogenase 


Deficient enzyme 


Trifunctional enzyme (TFE) 
deficiency 


Trifunctional enzyme 


Deficient enzyme 


Dienolyl-Co reductase 
deficiency 


2,4-dienoyl-CoA 
reductase 


Deficient enzyme 


Medium-chain acyl-CoA 
dehydrogenase (MCAD 
deficiency 


Medium-chain acyl-CoA 
dehydrogenase 


Deficient enzyme 


Short-chain acyl-CoA 
dehydrogenase (SCAD) 
deficiency 


Short-chain acyl-CoA 
dehydrogenase 


Deficient enzyme 
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TABLE -4 



Disease 



Enzyme 



Dysfunction 



Glutaric acidemia type II 


Electron transfer 
flavoprotein (ETF); 
ETF:ubiquinone 
oxidoreductase 


In some cases, no 
enzyme antigen; in 
others, no enzyme 
activity 


Glycerol kinase deficiency (Gkd) 


Glycerol kinase 


The microdeletion 
involves not only GK but 
also the other deleted 
loci: AHC, DMD, OTC, 
and other linked loci 


Primary gout: superactive 
variant of 
phosphoribosylpyrophosphate 
(PP-ribose-P) synthetase 


PP-ribose-P synthetase 


Enhanced enzyme 
activity 


Primary gout: partial deficiency 
of hypoxanthine guanine 
phosphoribosyltransferase 
(HPRT) 


Hypoxanthine guanine 
phosphoribosyl 
transferase (HPRT) 


Absent or deficient 
enzyme activity 


Lesch-Nyhan syndrome: 
deficiency of hypoxanthine 

guanine 
phosphoribosyltransferase 
(HPRT) 


Hypoxanthine guanine 
phosphoribosyltrans- 
ferase (HPRT) 


Absent or deficient 
enzyme activity 


2,8-Dihydroxyadenine lithiasis 
(adenine 

nhn^nhnriho^vltransf erase 

deficiency) 


Adenine 
phosphoribosyltransferas 
e 


Type 1: absent enzyme 
activity; type II: reducted 
affinity for PP-ribose-P 


Adenosine deaminase 
deficiency with severe combined 
immunodeficiency disease 


Adenosine deaminase 


Absent or greatly 
diminished enzyme 
activity 


Purine nucleoside 
phosphorylase deficiency with 
cellular immunodeficiency 


Purine nucleoside 
phosphorylase 


Ausent or greauy 
diminished enzyme 
activity 


Myoadenylate deaminase 
deficiency 


Myoadenylate deaminase 
(AMPDI) 


No enzyme activity; on 
immunoreactive protein 


Xanthinuria 


Xanthine dehydrogenase 
(xanthine oxidase) 


Type 1: absent xanthine 
dehydrogenase activity; 
type II: absent xanthine 

dehydrogenase and 
aldehyde oxidase activity 
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TABLE -4 



Disease 



Enzyme 



Dysfunction 



Hereditary orotic aciduria 


UMP synthase 


Deficient enzyme activity 
(unstable protein) 


Pvrimidine S'-nucleotidase 
deficiency 


Pyrimidine 5'- 
nucleotidase 


Absent or unstable 
enzyme 


DihvdroDvrimidine 
dehydrogenase deficiency 


Dihydropyrimidine 
dehydrogenase 


Absent or unstable 
enzyme 


nihvHroDrvimidase deficiency 

Illy \aJ 1 |^ 1 V 1 1 III wiww » i ^-^ * 1 ■ J 


Dihydropyrimidase 


Absent or unstable 
enzyme 


Familial liooDrotein lioase 

I Cl 1 1 Jill CI 1 II W Ly 1 W III U p'^ 4 

deficiency 


Lipoprotein lipase 


Nonfunctional protein in 
some, nondectable 
enzyme activity and 
nrotein in others 


Familial lecithin: chloesterol 
acyltransferase deficiency 


Lecithin:cholesterol 
acyiiroiioici doc 


Absent enzyme protein or 
ripficient enzvme activity 

V^i> 1 1 Ul w lie \* I l^~ j i i • » « *j 


5-Aminolevulinic acid 
dehydratase porphyria 


5-Aminolevulinic acid 
uenyui oiaot; 


Minimal enzyme activity 


Acute intermittent porphyria 


Porphobilinogen 

/-i ^ r"v">i n^c /**v 


Decreased enzyme 
activitv f~50%) 


Congenital erythropoietic 
porphyria 


Uroporphyrinogen III 
cosyninasc 


Minimal enzyme activity 


Porphyria cutanea tarda (familial 
form) 


Uroporphyrinogen 
GecarDOxyiabc 


Decreased enzyme 
activitv f-50%) 


Hepatoerythropoietic porphyria 


Uroporphyrinogen 
aecaruoxyiase 


Minimal enzyme activity 


Hereditary coproporphyria 


Coproporphyrinogen 
oxidase 


Decreased enzyme 
activity (-50%) 


Variegate porphyria 


Protoporphyrinogen 
oxidase 


Decreased enzyme 
activity (-50%) 


Erythropoietic protophorphyria 


Ferrochelatase 


Decrease enzyme activity 
(-50%) 


Crigler-Najjar syndrome, type 1 


Bilirubin UDP- 
glucuronosyltransferase 


Absent enzyme activity 


Crigler-Najjar syndrome, type II 


Bilirubin UDP- 
glucuronosyltransferase, 


Markedly reduced 
enzyme activity 
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TABLE -4 



Disease 



Enzyme 



Dysfunction 



Gilbert syndrome 


Bilirubin UDP- 
glucuronosyltransferase 
activity 


Reduced enzyme activity 


Refsum disease 


Phytanic acid cc- 
hydroxylase 


Deficient enzyme activity 


Primary hyperoxaluria type 1 


Alanine-glyoxylate 
aminotransferase 


Loss of enzyme catalytic 
activity and aberrant 
subcellular distribution 


Primary hyperoxaluria type 2 


Glyoxylate reductase/D- 
glycerate dehydrogenase 


Loss of enzyme catalytic 
activity 


G M2 gangliosidosis: 
hexosaminadase a-subunit 
deficiency (variant B, Tay-Sachs 
disease) 


^-hexosaminidase 


hexosamininidase A (a|3) 
activity 


Glycogen storage disease type 
II 


a-glucosidase 


Absent or deficient 
enzyme activity 


Mucopolysaccharidosis 1 
(Hurler, Scheie, and Hurler- 
Sheie syndromes, MPS, MPS 
IS, MPS IH/S) 


a-L-iduronidase 


Absent enzyme activity 


Mucopolysaccharidosis II 
(Hunter syndrome) 


Iduronate sulfatase 


Absent enzyme activity 


Mucopolysaccharidosis III 
(Saniflippo syndrome) types A, 
B, C and D 


MIA- Hpnaran A/-<?Li!fatase 

IIIB: a-N-acetyl- 
glucosaminidase 
IIIC: Acetyl-GoA: a- 
glucosaminide 
acetyltransferase 
HID: /V-acetyl- 
glucosamine-6-sulfatase 


Absent enzyme activity 


Mucopolysaccharidosis IV 
(Morquio syndrome) types A 
and B 


IVA: Galactose 6- 
sulfatase 
IVB: p-Galactosidase 


Absent enzyme activity 


Mucopolysaccharidosis VI 
(Maroteaux-Lamy syndrome) 


/V-acetyl-galactosamine 
4-sulfatase 


Absent enzyme activity 


Mucopolysaccharidosis VII (Sly 
syndrome) 


p-glucuronidase 


Absent enzyme activity 
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TABLE -4 



Disease 1 


Enzyme I 


Dysfunction | 




l-celt disease (ML-ll) 


N-acetylglucosaminyl-l- 
phosphotransferase 


Phosphorylation of many 
lysosomal enzymes 


Schindler disease (a-/V-acetyl- 
galactosaminidase deficiency) 


a-/V-acetyl- 
galactosaminidase 


Deficient activity of a-N- 
acetylgalactosaminidase 


a-Mannosidosis 


cc-D-mannosidase 


Deficient or unstable 
enzyme activity 


p-Mannosidosis 


P-D-mannosidase 


Deficient enzyme activity 


Sialidosis 


a-neuraminidase 


Deficient enzyme activity 


Aspartylglucosaminuria 


Aspartyglucosaminidase 


Deficient enzyme activity 


Fucosidosis 


a-L-fucosidase 


Deficient enzyme activity 


Wolman disease and cholesteryl 
ester storage disease 


Acid lipase 


Deficient enzyme activity 


Ceramidase deficiency (Farber 
lipogranulomatosis) 


Ceramidase 


Deficient enzyme activity 


Niemann-Pick disease (NPD) 
types A and B (primary 
sphingomyelin storage) 


opningomyeiiricioc 


Deficient 
sphingomyelinase activity 


Gaucher disease type 1 
(nonneuronopathic) 


Glucocerebrosidase 


Decreased catalytic 
activity and some 
instability of enzyme 
protein 


Globoid-cell leukodystrophy 
(Krabbe disease) 


Galactosylceramidase 


Absent enzyme activity 


Metachromatic leukodystrophy 


Arylsulfatase A 


Deficient enzyme activity 


rdui y uiocctoc 


a-Galactosidase A 


Nonfunctional or unstable 
enzyme protein 


G M1 gangliosidosis 


Acid P-galactosidase 
(GLBI) 


Deficient enzyme activity 


G M2 gangliosidosis: 
hexosaminidase cc-subunit 
deficiency (variant B, Tay-Sachs 
disease) 


P-hexosaminidase 


Absent or defective 
hexosaminidase A (aP) 
activity 


Steroid 21 -hydroxylase 
deficiency salt-losing form 


Steroid 21 -hydroxylase 


Absent or truncated 
enzyme with no activity 
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TABLE -4 



Disease 



Enzyme 



Dysfunction 



Steroid 5a-reductase 2 
deficiency 


Steroid 5ct-reductase 2 


Absent or unstable 
enzyme activity 


Steroid sulfatase deficiency (X- 
linked ichthyosis) 


3P-hydroxysteroid 
sulfatase 


Absent immunoreactive 
and enzymatically active 
nrofpin ^hnth deletion and 
nondeletion patients) 


Methylenetetrahydrofolate 
reductase deficiency 


Methylenetetrahydro- 
folate reductase 


Absent or deficient 
enzyme activity. 
Thermolabile variants 
have been described. 


Holocarboxylase synthetase 
deficiency 


Holocarboxylase 
synthetase 


Deficient holocarboxylase 
synthetase activity 


Biotinidase deficiency 


Biotinidase 


Deficient biotinidase 
activity 


Hereditary methemoglobinemia 

secondary to cytochrome b 5 
reductase deficiency, types 1, II, 
and III 


Cytochrome b 5 reductase 


Deficient enzyme activity 
in erythrocyte cytosol only 

ftvnfa |\ in all ti^^UP^ 
ilyijc it, ii • cm uoouco 

(type II), and in all 
hematopoetic cells (type 
III) 


Pyruvate kinase deficiency 
hemolytic anemia 


Pyruvate kinase 


Deficient enzyme activity 


Hexokinase deficiency 
hemolytic anemia 


Hexokinase 


Deficient enzyme activity 


Glucosephosphate isomerase 
deficiency hemolytic anemia 


Glucosephosphate 
isomerase 


Deficient enzyme activity 


Aldolase deficiency hemolytic 
anemia 


Aldolase (A type) 


Deficient enzyme activity 


Triosephosphate isomerase 
deficiency hemolytic anemia 


Triosephosphate 
isomerase 


Enzyme activity deficient 
in all tissues 


Phosphoglycerate kinase 
deficiency hemolytic anemia 


Phosphoglycerate kinase 


Deficient enzyme activity 
in hemizygotes 


2,3-Diphospho-glyceromutase 
and phosphatase deficiency 


2,3-Diphospho-glycerate- 
mutase and phosphatase 
(1 protein) 


Deficient enzyme activity 


6-Phosphogluconate 
dehydrogenase deficiency 


6-Phosphogluconate 
dehydrogenase 


Enzyme activity 
deficiency 
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TABLE -4 



Disease 



Enzyme 



Dysfunction 



Glutathione peroxidase 
deficiency 


Glutathione peroxidase 


Diminished enzyme 
activity 


Cjiutatnione reductase 
deficiency 


f^li itathione reductase 


Deficient enzyme activity 


Qjiutatnione synineiase 
deficiency hemolytic anemia 


nil itathinn^ svnthetase 


Deficient enzyme activity 


Y-olutamyicysieine synineiase 
deficiency hemolytic anemia 


v-fnh itamvlcvsteine 
synthetase 


Deficient enzyme activity 


Adenosine deaminase 
hyperactivity hemolytic anemia 


Adenosine deaminase 


Overproduction of 
structurally normal 
enzvme Drotein mediated 
at mRNA translation level 


Pyrimidine nucleotidase 
deficiency hemolytic anemia 


Pyrimidine nucleotidase 


Deficient enzyme activity 


Mvplnnproxidase deficiency 


Myeloperoxidase 


Absent or deficient 
enzyme activity 


Carbonic anhydrase II 
deficiency syndrome 
(osteopetrosis with renal tubular 
acidosis) 


Carbonic anhydrase II 


Quantitative deficiency of 
carbonic anhydrase II 


Albinism, oculocutaneous 
tyrosinase-negative type 
(OCAIA) 


Tyrosinase 


Absent, reduced, or 
unusual enzyme activity 


Canavan disease 


Aspartoacylase 


Deficient enzyme activity 
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TABLE 5. Molecular masses of protonated and sodiated substrate-conjugates, products, and internal 
standards for CDGS enzymes. 



m/z 



Enzyme 


Substrate 




Product 


internal 
standard 






(M + H)* 


(M + Na)* 


(M + H)* 


(M + Na)* 


(M + H) + 


(M + Na) + 


Type la.b 


711 


733 


549 


571 


555 


577 


Manose- 
transferase 


725 


747 


563 


585 


570 


592 


Type II 


1156 


1178 


1343 


1365 


1348 


1370 


Type 


1126 


1148 


2362 a 


2384 a 


2367 a 


2389 a 



a Calculated for the GlcNAc-T II product and internal standard containin a GlcNAc-GlcNAc-Mannose 



(Mannose-GlcNAc) 2 residue. 
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Scheme 1 




LINKER (heavy has 8 deuteriums) 



FUNCTIONAL GROUP 



Flowchart illustration of the procedure for quantitative protein profile 
measurements based on stable isotope ratios by mass spectrometry 



Sample protein mixture 



Reference Protein mixture 



Derivatize cysteines 
with light isotope reagent 




Derivatize cysteines 

with heavy isotope reagent 



Enzymatically cleave protein 



Affinity isolation of derivatized peptides 



Analysis and quantitation of derivatized peptides by LC-MS/MS 
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Scheme 2 



Custom chemical synthesis of sulfhydryl-reactive biotinylating reagent. Linker can 
made either isotopically heavy (d8) or light(dO). 
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Scheme 3 

Synthesis of reagent specific for biotinylation of free amino groups. The reagent is 
synthesized in both isotopically heavy (d8) or light (dO) form. 
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Scheme 4 




Scheme 5 

Structures of Substrate Conjugates 1 and 2, Product Conjugates 3 and 4, and Internal 
Standards 5 and 6 




= CH 2 0 = CD 2 
n = 0 
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Scheme 7 




(CH 2 )i2CH 3 



Galactosyl- 
Cerebroside 



Scheme 8 



Glucose 



ATP 



Glucose-6-P 



Mannose 



ATP 



Fructose-6-P 

t 

PMlb 
Mannose-6-P 



PMM2 



Dol-P-mannose 



7a 

GDP Dol-P 



GDP-mannose 



GTP 



Mannose- 1-P 



Dol = Dolichol 
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Scheme 9 




Scheme 10 




;ine 
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Scheme 1 1 




Scheme 12 




bf 3j Et 2 o 
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Scheme 1 2 (continued) 

OAC 




Scheme 13 




B-S-NH-Try-Gln-Ser-Asn-Ser-Thr-Met-NH 2 
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Scheme 15 
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Scheme 16 
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Scheme 17 
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Scheme 18 
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Scheme 22 
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